BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 021493
(311 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9FNY7|OGG1_ARATH N-glycosylase/DNA lyase OGG1 OS=Arabidopsis thaliana GN=OGG1 PE=1
SV=1
Length = 365
Score = 426 bits (1096), Expect = e-119, Method: Compositional matrix adjust.
Identities = 215/312 (68%), Positives = 244/312 (78%), Gaps = 13/312 (4%)
Query: 11 LKRPRLTPQP------------PPTPPNPQTLTTNKPSKWTPLNLTQSELSLPLTFPTGQ 58
+KRPR T QP P TP Q L KW PL LT +EL+LPLTFPTGQ
Sbjct: 1 MKRPRPTSQPSISSTVKPPLSPPVTPILKQKLHRTGTPKWFPLKLTHTELTLPLTFPTGQ 60
Query: 59 TFRWKKTGPLQYTGPIGPHLISLKHLQNGD-VCYHIHTSPSEPAAKSALLDFLNMGISLG 117
TFRWKKTG +QY+G IGPHL+SL+ D V Y +H S S +A+ ALLDFLN ISL
Sbjct: 61 TFRWKKTGAIQYSGTIGPHLVSLRQRPGDDAVSYCVHCSTSPKSAELALLDFLNAEISLA 120
Query: 118 ELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGS 177
ELW FS D RF ELA++L GARVLRQDP+ECL+QFLCSSNNNIARITKMVDF++SLG
Sbjct: 121 ELWSDFSKKDPRFGELARHLRGARVLRQDPLECLIQFLCSSNNNIARITKMVDFVSSLGL 180
Query: 178 HLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRAKYITGTVDVLQSKHSGGAEWLLSL 237
HLG+++GFEFH+FPSL+RLS VSE E R AGFGYRAKYITGTV+ LQ+K GG EWLLSL
Sbjct: 181 HLGDIDGFEFHQFPSLDRLSRVSEEEFRKAGFGYRAKYITGTVNALQAKPGGGNEWLLSL 240
Query: 238 RKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVRL 297
RK++LQEA+ ALCTLPGVGPKVAACIALFSLDQH AIPVDTHVW+IAT YLLP+LAG +L
Sbjct: 241 RKVELQEAVAALCTLPGVGPKVAACIALFSLDQHSAIPVDTHVWQIATNYLLPDLAGAKL 300
Query: 298 TPKLCSRVAEAF 309
TPKL RVAEAF
Sbjct: 301 TPKLHGRVAEAF 312
>sp|O15527|OGG1_HUMAN N-glycosylase/DNA lyase OS=Homo sapiens GN=OGG1 PE=1 SV=2
Length = 345
Score = 195 bits (496), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 101/262 (38%), Positives = 149/262 (56%), Gaps = 8/262 (3%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 196
Query: 205 RNAGFGYRAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIA 264
R G GYRA+Y++ + + + GG WL LR+ +EA ALC LPGVG KVA CI
Sbjct: 197 RKLGLGYRARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHKALCILPGVGTKVADCIC 255
Query: 265 LFSLDQHHAIPVDTHVWKIATR 286
L +LD+ A+PVD H+W IA R
Sbjct: 256 LMALDKPQAVPVDVHMWHIAQR 277
>sp|O08760|OGG1_MOUSE N-glycosylase/DNA lyase OS=Mus musculus GN=Ogg1 PE=2 SV=2
Length = 345
Score = 192 bits (487), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 104/272 (38%), Positives = 155/272 (56%), Gaps = 9/272 (3%)
Query: 22 PTPPNPQTLTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISL 81
P+ +TL+++ P+ W + +SEL L L +GQ+FRWK+ P ++G + + +L
Sbjct: 8 PSSMRHRTLSSS-PALWASIPCPRSELRLDLVLASGQSFRWKEQSPAHWSGVLADQVWTL 66
Query: 82 KHLQNGDVCYHIHTSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKY 136
++ C S+ + + L + + +SL +L+ +++ D F +A+
Sbjct: 67 TQTEDQLYCTVYRGDDSQVSRPTLEELETLHKYFQLDVSLAQLYSHWASVDSHFQRVAQK 126
Query: 137 LAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLER 195
G R+LRQDP ECL F+CSSNNNIARIT MV+ L + G L ++ +H FP+L
Sbjct: 127 FQGVRLLRQDPTECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPNLHA 186
Query: 196 LSL-VSEVELRNAGFGYRAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPG 254
L+ +E LR G GYRA+Y+ + + + GG WL LR +EA ALCTLPG
Sbjct: 187 LAGPEAETHLRKLGLGYRARYVRASAKAILEEQ-GGPAWLQQLRVAPYEEAHKALCTLPG 245
Query: 255 VGPKVAACIALFSLDQHHAIPVDTHVWKIATR 286
VG KVA CI L +LD+ A+PVD HVW+IA R
Sbjct: 246 VGAKVADCICLMALDKPQAVPVDVHVWQIAHR 277
>sp|O70249|OGG1_RAT N-glycosylase/DNA lyase OS=Rattus norvegicus GN=Ogg1 PE=2 SV=1
Length = 345
Score = 189 bits (479), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 100/262 (38%), Positives = 146/262 (55%), Gaps = 8/262 (3%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
T+ P+ W + +SEL L L +GQ+FRW++ P ++G + + +L ++ C
Sbjct: 17 TSSPALWASIPCPRSELRLDLVLASGQSFRWREQSPAHWSGVLADQVWTLTQTEDQLYCT 76
Query: 92 HIHTSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
+ + L + + +SL +L+ +++ D F +A+ G R+LRQD
Sbjct: 77 VYRGDKGQVGRPTLEELETLHKYFQLDVSLTQLYSHWASVDSHFQSVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P ECL F+CSSNNNIARIT MV+ L + G L ++ +H FP+L L+ E L
Sbjct: 137 PTECLFSFICSSNNNIARITGMVERLCQAFGPRLVQLDDVTYHGFPNLHALAGPEVETHL 196
Query: 205 RNAGFGYRAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIA 264
R G GYRA+Y+ + + + GG WL LR +EA ALCTLPGVG KVA CI
Sbjct: 197 RKLGLGYRARYVCASAKAILEEQ-GGPAWLQQLRVASYEEAHKALCTLPGVGTKVADCIC 255
Query: 265 LFSLDQHHAIPVDTHVWKIATR 286
L +LD+ A+PVD HVW+IA R
Sbjct: 256 LMALDKPQAVPVDIHVWQIAHR 277
>sp|Q9V3I8|OGG1_DROME N-glycosylase/DNA lyase OS=Drosophila melanogaster GN=Ogg1 PE=2
SV=2
Length = 343
Score = 151 bits (382), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 94/282 (33%), Positives = 150/282 (53%), Gaps = 19/282 (6%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKT---GPLQYTGPIGPHLISLKHLQNGDVCYHIHTSP 97
+ L+ E L T GQ+FRW+ +Y G + L+ + + Y + +
Sbjct: 27 IGLSLEECDLERTLLGGQSFRWRSICDGNRTKYGGVVFNTYWVLQQ-EESFITYEAYGTS 85
Query: 98 SEPAAK---SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAG-ARVLRQDPVECLLQ 153
S A K S + D+L + L + + + D F K+L+ R+L Q+P E +
Sbjct: 86 SPLATKDYSSLISDYLRVDFDLKVNQKDWLSKDDNFV---KFLSKPVRLLSQEPFENIFS 142
Query: 154 FLCSSNNNIARITKMVD-FLASLGSHLGNVEGFEFHEFPSLERLSLV----SEVELRNAG 208
FLCS NNNI RI+ M++ F A+ G+ +G+ G + + FP++ R + +LR A
Sbjct: 143 FLCSQNNNIKRISSMIEWFCATFGTKIGHFNGADAYTFPTINRFHDIPCEDLNAQLRAAK 202
Query: 209 FGYRAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 268
FGYRAK+I T+ +Q K GG W +SL+ + ++A + L LPG+G KVA CI L S+
Sbjct: 203 FGYRAKFIAQTLQEIQKK--GGQNWFISLKSMPFEKAREELTLLPGIGYKVADCICLMSM 260
Query: 269 DQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAF 309
++PVD H+++IA Y LP L G + +T K+ V++ F
Sbjct: 261 GHLESVPVDIHIYRIAQNYYLPHLTGQKNVTKKIYEEVSKHF 302
>sp|P53397|OGG1_YEAST N-glycosylase/DNA lyase OS=Saccharomyces cerevisiae (strain ATCC
204508 / S288c) GN=OGG1 PE=1 SV=1
Length = 376
Score = 139 bits (351), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 91/268 (33%), Positives = 143/268 (53%), Gaps = 20/268 (7%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRW---KKTGPLQYTGPIGPH----LISLKHLQNGDV 89
K+ L + +SEL L GQ+FRW +K T IG ++ L+ + ++
Sbjct: 4 KFGKLAINKSELCLANVLQAGQSFRWIWDEKLNQYSTTMKIGQQEKYSVVILRQDEENEI 63
Query: 90 CYHIHTSP--SEPAAKSALLDFLNMGISLGELWEG-FSASDCRFAELAKYLAGARVLRQD 146
+ ++ A K+ L+ + + +SL L++ + SD FA+L+ G R+L Q+
Sbjct: 64 LEFVAVGDCGNQDALKTHLMKYFRLDVSLKHLFDNVWIPSDKAFAKLSP--QGIRILAQE 121
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERL-SLVSEVEL 204
P E L+ F+CSSNNNI+RIT+M + L S G+ + ++G +H FP+ E L S +E +L
Sbjct: 122 PWETLISFICSSNNNISRITRMCNSLCSNFGNLITTIDGVAYHSFPTSEELTSRATEAKL 181
Query: 205 RNAGFGYRAKYITGTV-----DVLQSKHSGGAEWLLSL-RKLDLQEAIDALCTLPGVGPK 258
R GFGYRAKYI T D ++ + +L S+ + ++ + L + GVGPK
Sbjct: 182 RELGFGYRAKYIIETARKLVNDKAEANITSDTTYLQSICKDAQYEDVREHLMSYNGVGPK 241
Query: 259 VAACIALFSLDQHHAIPVDTHVWKIATR 286
VA C+ L L +PVD HV +IA R
Sbjct: 242 VADCVCLMGLHMDGIVPVDVHVSRIAKR 269
>sp|O27397|OGG1_METTH Probable N-glycosylase/DNA lyase OS=Methanothermobacter
thermautotrophicus (strain ATCC 29096 / DSM 1053 / JCM
10044 / NBRC 100330 / Delta H) GN=MTH_1342 PE=3 SV=1
Length = 312
Score = 77.0 bits (188), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/174 (30%), Positives = 81/174 (46%), Gaps = 28/174 (16%)
Query: 133 LAKYLAGARVLR----QDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFH 188
L+ L +R LR +DP EC++ + S+N ++ R T+ ++ + L G FH
Sbjct: 92 LSYTLDSSRGLRLFLAKDPFECVISSIASANCSVVRWTRSIEDIRRLWGQANTFNGETFH 151
Query: 189 EFPSLERLSLVSE-------------------VELRNAGFGYRAKYITGTVDVLQSKHSG 229
FPS L+ V+E +LR+ G GYRA YI T +L +
Sbjct: 152 TFPSPHVLTGVAEGSLEDLQRAEDNLPSDFSFNDLRSCGVGYRAPYIRETSRILAEEMD- 210
Query: 230 GAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 283
+ + +D +A + L L GVGPKVA CI L+ + A PVD + +I
Sbjct: 211 ----IRRIDGMDYDDARELLLELSGVGPKVADCILLYGFRKTEAFPVDVWIRRI 260
>sp|P54137|NTH1_CAEEL Probable endonuclease III homolog OS=Caenorhabditis elegans
GN=nth-1 PE=1 SV=2
Length = 298
Score = 50.8 bits (120), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 71/157 (45%), Gaps = 27/157 (17%)
Query: 132 ELAKYLAGARVLR-QDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEF 190
+LA LA V R Q V +L N A + ++ D S+G L EF +
Sbjct: 55 KLADPLAAPPVHRFQVLVALMLSSQTRDEVNAAAMKRLKDHGLSIGKIL------EF-KV 107
Query: 191 PSLERLSLVSEVELRNAGFGYR-AKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDAL 249
P LE + L GF R A Y+ T +L+ SG D+ +++D L
Sbjct: 108 PDLETI-------LCPVGFYKRKAVYLQKTAKILKDDFSG-----------DIPDSLDGL 149
Query: 250 CTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 286
C LPGVGPK+A + + + I VDTHV +I+ R
Sbjct: 150 CALPGVGPKMANLVMQIAWGECVGIAVDTHVHRISNR 186
>sp|O35980|NTHL1_MOUSE Endonuclease III-like protein 1 OS=Mus musculus GN=Nthl1 PE=2 SV=1
Length = 300
Score = 45.1 bits (105), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 36/75 (48%), Gaps = 11/75 (14%)
Query: 212 RAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 271
+ KYI T +LQ ++ G D+ ++ L LPGVGPK+A +
Sbjct: 173 KVKYIKQTTAILQQRYEG-----------DIPASVAELVALPGVGPKMAHLAMAVAWGTI 221
Query: 272 HAIPVDTHVWKIATR 286
I VDTHV +IA R
Sbjct: 222 SGIAVDTHVHRIANR 236
>sp|Q2KID2|NTHL1_BOVIN Endonuclease III-like protein 1 OS=Bos taurus GN=NTHL1 PE=2 SV=1
Length = 305
Score = 44.7 bits (104), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 36/75 (48%), Gaps = 11/75 (14%)
Query: 212 RAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 271
+ KYI T +LQ ++ G D+ ++ L LPGVGPK+A +
Sbjct: 178 KVKYIKQTSAILQQRYDG-----------DIPASVAELVALPGVGPKMAHLAMAVAWGTV 226
Query: 272 HAIPVDTHVWKIATR 286
I VDTHV +IA R
Sbjct: 227 SGIAVDTHVHRIANR 241
>sp|Q9SJQ6|ROS1_ARATH Protein ROS1 OS=Arabidopsis thaliana GN=ROS1 PE=1 SV=2
Length = 1393
Score = 43.9 bits (102), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 52/96 (54%), Gaps = 8/96 (8%)
Query: 195 RLSLVSEVE--LRNAGFGYR-AKYITGTVDVLQSKH-SGGAEWLLSLRKLDLQEAIDALC 250
R + V EV +++ G ++ A+ I G +D L + H S EWL R + +A + L
Sbjct: 889 RAADVKEVAETIKSRGMNHKLAERIQGFLDRLVNDHGSIDLEWL---RDVPPDKAKEYLL 945
Query: 251 TLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 286
+ G+G K C+ L +L H A PVDT+V +IA R
Sbjct: 946 SFNGLGLKSVECVRLLTL-HHLAFPVDTNVGRIAVR 980
>sp|P78549|NTHL1_HUMAN Endonuclease III-like protein 1 OS=Homo sapiens GN=NTHL1 PE=1 SV=2
Length = 312
Score = 43.5 bits (101), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 37/75 (49%), Gaps = 11/75 (14%)
Query: 212 RAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 271
+ KYI T +LQ +H GG D+ ++ L LPGVGPK+A +
Sbjct: 185 KVKYIKQTSAILQ-QHYGG----------DIPASVAELVALPGVGPKMAHLAMAVAWGTV 233
Query: 272 HAIPVDTHVWKIATR 286
I VDTHV +IA R
Sbjct: 234 SGIAVDTHVHRIANR 248
>sp|O49498|DML3_ARATH DEMETER-like protein 3 OS=Arabidopsis thaliana GN=DML3 PE=2 SV=2
Length = 1044
Score = 40.8 bits (94), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 249 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 286
L + G+G K A C+ L L +HHA PVDT+V +IA R
Sbjct: 584 LLEIEGIGLKSAECVRLLGL-KHHAFPVDTNVGRIAVR 620
>sp|P31378|NTG1_YEAST Mitochondrial DNA base excision repair N-glycosylase 1
OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c)
GN=NTG1 PE=1 SV=1
Length = 399
Score = 39.7 bits (91), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 55/132 (41%), Gaps = 22/132 (16%)
Query: 154 FLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVEL----RNAGF 209
L S + M++ + L + EG +LE + ++E +L + GF
Sbjct: 151 MLSSQTKDEVTAMAMLNIMRYCIDELHSEEGM------TLEAVLQINETKLDELIHSVGF 204
Query: 210 GYR-AKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 268
R AKYI T +LQ + S D+ I+ L LPGVGPK+A +
Sbjct: 205 HTRKAKYILSTCKILQDQFSS-----------DVPATINELLGLPGVGPKMAYLTLQKAW 253
Query: 269 DQHHAIPVDTHV 280
+ I VD HV
Sbjct: 254 GKIEGICVDVHV 265
>sp|Q8LK56|DME_ARATH Transcriptional activator DEMETER OS=Arabidopsis thaliana GN=DME PE=1
SV=2
Length = 1987
Score = 39.7 bits (91), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 54/120 (45%), Gaps = 7/120 (5%)
Query: 170 DFLASLGSHLGNVEGFEFHEFPSLERLSL--VSEVELRNAGFGYRAKYITGTVDVLQSKH 227
D + G N + ++ ++ R S+ +SE A I ++ + H
Sbjct: 1456 DVEGNEGRQERNKNNMDSIDYEAIRRASISEISEAIKERGMNNMLAVRIKDFLERIVKDH 1515
Query: 228 SG-GAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 286
G EWL R+ +A D L ++ G+G K C+ L +L + A PVDT+V +IA R
Sbjct: 1516 GGIDLEWL---RESPPDKAKDYLLSIRGLGLKSVECVRLLTL-HNLAFPVDTNVGRIAVR 1571
>sp|Q9SR66|DML2_ARATH DEMETER-like protein 2 OS=Arabidopsis thaliana GN=DML2 PE=3 SV=2
Length = 1332
Score = 39.3 bits (90), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 213 AKYITGTVDVLQSKH-SGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 271
A+ I ++ L KH S EWL R + +A + L ++ G+G K C+ L SL Q
Sbjct: 842 AERIKAFLNRLVKKHGSIDLEWL---RDVPPDKAKEYLLSINGLGLKSVECVRLLSLHQ- 897
Query: 272 HAIPVDTHVWKIATR 286
A PVDT+V +IA R
Sbjct: 898 IAFPVDTNVGRIAVR 912
>sp|Q09907|NTH1_SCHPO Endonuclease III homolog OS=Schizosaccharomyces pombe (strain 972 /
ATCC 24843) GN=nth1 PE=1 SV=1
Length = 355
Score = 38.1 bits (87), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 44/101 (43%), Gaps = 14/101 (13%)
Query: 184 GFEFHEFPSLERLSLVSEVELRNAGFGYRAK-YITGTVDVLQSKHSGGAEWLLSLRKLDL 242
G + +++ +SL +E GF R Y+ +L K G D+
Sbjct: 80 GLCLEDIQNIDEVSLNKLIE--KVGFHNRKTIYLKQMARILSEKFQG-----------DI 126
Query: 243 QEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 283
+ ++ L TLPGVGPK+ + ++ I VD HV +I
Sbjct: 127 PDTVEDLMTLPGVGPKMGYLCMSIAWNKTVGIGVDVHVHRI 167
>sp|Q10630|ALKA_MYCTU Probable bifunctional transcriptional activator/DNA repair enzyme
AlkA OS=Mycobacterium tuberculosis GN=alkA PE=3 SV=1
Length = 496
Score = 37.4 bits (85), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 49/121 (40%), Gaps = 14/121 (11%)
Query: 171 FLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRAKYITGTVDVLQSKH--- 227
+A+ G + + G H FPS+E+L+ + L A R + I V L K
Sbjct: 342 LVAAYGRPVHDRHGALTHTFPSIEQLAEIDPGHL--AVPKARQRTINALVASLADKSLVL 399
Query: 228 SGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRY 287
G +W Q A L LPGVGP A IA+ L A P ++A +
Sbjct: 400 DAGCDW---------QRARGQLLALPGVGPWTAEVIAMRGLGDPDAFPASDLGLRLAAKK 450
Query: 288 L 288
L
Sbjct: 451 L 451
>sp|Q8SRB8|NTH1_ENCCU Endonuclease III homolog OS=Encephalitozoon cuniculi (strain GB-M1)
GN=NTH1 PE=1 SV=1
Length = 238
Score = 36.6 bits (83), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 34/70 (48%)
Query: 237 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR 296
LR+ L + L +LPG+G K+A + ++ I VDTHV +I+ R L V
Sbjct: 128 LREKGLPREMKDLISLPGIGNKMALLYMSHACNRTVGISVDTHVHRISNRIGLVRTRDVE 187
Query: 297 LTPKLCSRVA 306
T + RV
Sbjct: 188 STRRELERVV 197
>sp|Q6AB99|RECR_PROAC Recombination protein RecR OS=Propionibacterium acnes (strain
KPA171202 / DSM 16379) GN=recR PE=3 SV=1
Length = 201
Score = 34.7 bits (78), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 18/28 (64%), Positives = 19/28 (67%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLD 269
LQE IDAL LPG+GPK A IA LD
Sbjct: 6 LQELIDALSRLPGIGPKGAQRIAFHILD 33
>sp|P44319|END3_HAEIN Endonuclease III OS=Haemophilus influenzae (strain ATCC 51907 / DSM
11121 / KW20 / Rd) GN=nth PE=3 SV=1
Length = 211
Score = 34.7 bits (78), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 12/78 (15%)
Query: 209 FGYRAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSL 268
F +A+ I T L KH+G ++ E +AL L GVG K A + L +
Sbjct: 82 FNSKAENIIKTCRDLIEKHNG-----------EVPENREALEALAGVGRKTAN-VVLNTA 129
Query: 269 DQHHAIPVDTHVWKIATR 286
H I VDTH++++ R
Sbjct: 130 FGHPTIAVDTHIFRVCNR 147
>sp|Q92383|MAG1_SCHPO DNA-3-methyladenine glycosylase 1 OS=Schizosaccharomyces pombe
(strain 972 / ATCC 24843) GN=mag1 PE=1 SV=1
Length = 228
Score = 33.5 bits (75), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 24/98 (24%), Positives = 44/98 (44%), Gaps = 8/98 (8%)
Query: 203 ELRNAGF------GYRAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVG 256
E+R+ F G+ A+ I + ++ SG +L +E I+ L + G+G
Sbjct: 89 EIRDMDFEIMRACGFSARKIDSLKSIAEATISGLIPTKEEAERLSNEELIERLTQIKGIG 148
Query: 257 PKVAACIALFSLDQHHAIPVDTHVWKIATRYL--LPEL 292
+ +FSL++ +P D + RYL LP++
Sbjct: 149 RWTVEMLLIFSLNRDDVMPADDLSIRNGYRYLHRLPKI 186
>sp|A0R567|MUTY_MYCS2 A/G-specific adenine glycosylase OS=Mycobacterium smegmatis (strain
ATCC 700084 / mc(2)155) GN=mutY PE=3 SV=1
Length = 293
Score = 32.0 bits (71), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 17/41 (41%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 246 IDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 286
+D L TLPG+G A +A F+ Q VDT+V ++ TR
Sbjct: 106 VDTLLTLPGIGAYTARAVACFAY-QASVPVVDTNVRRVVTR 145
>sp|B7GFF3|RECR_ANOFW Recombination protein RecR OS=Anoxybacillus flavithermus (strain
DSM 21510 / WK1) GN=recR PE=3 SV=1
Length = 198
Score = 32.0 bits (71), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 13/36 (36%), Positives = 21/36 (58%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVD 277
+ + ID+ LPG+GPK A +A F L+ + +D
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLNMKEDVVLD 42
>sp|P73715|END3_SYNY3 Endonuclease III OS=Synechocystis sp. (strain PCC 6803 / Kazusa)
GN=nth PE=3 SV=1
Length = 219
Score = 32.0 bits (71), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 32/147 (21%), Positives = 55/147 (37%), Gaps = 30/147 (20%)
Query: 145 QDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERLSLVSEVEL 204
Q PV+ L+ + S+ R+ K+ L F +P L+ E+
Sbjct: 36 QTPVQLLVATILSAQCTDERVNKVTPAL--------------FQRYPDANALAYGDRQEI 81
Query: 205 RN-----AGFGYRAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKV 259
F +AK I G + + G ++ + ++ L TLPGV K
Sbjct: 82 EELIHSTGFFRNKAKNIQGACRKIVEEFDG-----------EVPQRMEELLTLPGVARKT 130
Query: 260 AACIALFSLDQHHAIPVDTHVWKIATR 286
A + + + VDTHV +++ R
Sbjct: 131 ANVVLAHAFGILAGVTVDTHVKRLSQR 157
>sp|Q4JSL5|RECR_CORJK Recombination protein RecR OS=Corynebacterium jeikeium (strain
K411) GN=recR PE=3 SV=1
Length = 250
Score = 32.0 bits (71), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 24/43 (55%), Gaps = 3/43 (6%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIA 284
LQ+ ID LPG+GPK A IAL L++ P D ++ A
Sbjct: 6 LQDVIDEFSRLPGIGPKSAQRIALHLLNEE---PEDIERFQSA 45
>sp|A7GJT7|RECR_BACCN Recombination protein RecR OS=Bacillus cereus subsp. cytotoxis
(strain NVH 391-98) GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLD 269
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|Q6HPZ0|RECR_BACHK Recombination protein RecR OS=Bacillus thuringiensis subsp.
konkukian (strain 97-27) GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLD 269
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|Q63HE8|RECR_BACCZ Recombination protein RecR OS=Bacillus cereus (strain ZK / E33L)
GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLD 269
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|Q81JB9|RECR_BACCR Recombination protein RecR OS=Bacillus cereus (strain ATCC 14579 /
DSM 31) GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLD 269
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|B9IYI9|RECR_BACCQ Recombination protein RecR OS=Bacillus cereus (strain Q1) GN=recR
PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLD 269
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|B7HPT6|RECR_BACC7 Recombination protein RecR OS=Bacillus cereus (strain AH187)
GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLD 269
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|B7HIJ3|RECR_BACC4 Recombination protein RecR OS=Bacillus cereus (strain B4264)
GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLD 269
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|C1ES27|RECR_BACC3 Recombination protein RecR OS=Bacillus cereus (strain 03BB102)
GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLD 269
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|B7IS39|RECR_BACC2 Recombination protein RecR OS=Bacillus cereus (strain G9842)
GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLD 269
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|Q73FI4|RECR_BACC1 Recombination protein RecR OS=Bacillus cereus (strain ATCC 10987)
GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLD 269
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|B7JJD6|RECR_BACC0 Recombination protein RecR OS=Bacillus cereus (strain AH820)
GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLD 269
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|A0R899|RECR_BACAH Recombination protein RecR OS=Bacillus thuringiensis (strain Al
Hakam) GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLD 269
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|Q81W16|RECR_BACAN Recombination protein RecR OS=Bacillus anthracis GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLD 269
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|C3LIZ7|RECR_BACAC Recombination protein RecR OS=Bacillus anthracis (strain CDC 684 /
NRRL 3495) GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLD 269
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|C3P9H0|RECR_BACAA Recombination protein RecR OS=Bacillus anthracis (strain A0248)
GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLD 269
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|A9VN38|RECR_BACWK Recombination protein RecR OS=Bacillus weihenstephanensis (strain
KBAB4) GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLD 269
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|B0TXX4|SYP_FRAP2 Proline--tRNA ligase OS=Francisella philomiragia subsp.
philomiragia (strain ATCC 25017) GN=proS PE=3 SV=1
Length = 565
Score = 31.2 bits (69), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 23/82 (28%), Positives = 34/82 (41%), Gaps = 13/82 (15%)
Query: 46 SELSLPLTFPT---GQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCYH-IHTSPSEPA 101
SEL LP P+ +T RW K GP L+ LK D CY H P
Sbjct: 67 SELLLPSVLPSELLQETHRWDK---------FGPELLKLKDRHERDFCYGPTHEEPIVDM 117
Query: 102 AKSALLDFLNMGISLGELWEGF 123
A+ + + + ++L ++ F
Sbjct: 118 ARDTIKSYKQLPLNLYQIQTKF 139
>sp|A0L4V9|RUVA_MAGSM Holliday junction ATP-dependent DNA helicase RuvA OS=Magnetococcus
sp. (strain MC-1) GN=ruvA PE=3 SV=1
Length = 204
Score = 31.2 bits (69), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 28/54 (51%)
Query: 243 QEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR 296
+E + LCT+PGVG K A +A+ D+ A+P+ I + AG+R
Sbjct: 104 REDLTLLCTIPGVGKKTAQRMAMELKDKLGALPMAAPTTAIGAATMAANPAGLR 157
>sp|A8FAE6|RECR_BACP2 Recombination protein RecR OS=Bacillus pumilus (strain SAFR-032)
GN=recR PE=3 SV=1
Length = 198
Score = 31.2 bits (69), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 13/36 (36%), Positives = 20/36 (55%)
Query: 242 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVD 277
+ + ID+ LPG+GPK A +A F L + +D
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLSMQEDVVLD 42
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.136 0.420
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 122,725,777
Number of Sequences: 539616
Number of extensions: 5216601
Number of successful extensions: 29623
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 54
Number of HSP's successfully gapped in prelim test: 70
Number of HSP's that attempted gapping in prelim test: 28695
Number of HSP's gapped (non-prelim): 823
length of query: 311
length of database: 191,569,459
effective HSP length: 117
effective length of query: 194
effective length of database: 128,434,387
effective search space: 24916271078
effective search space used: 24916271078
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (28.1 bits)