BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 020295
(328 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9FNY7|OGG1_ARATH N-glycosylase/DNA lyase OGG1 OS=Arabidopsis thaliana GN=OGG1 PE=1
SV=1
Length = 365
Score = 416 bits (1070), Expect = e-115, Method: Compositional matrix adjust.
Identities = 215/329 (65%), Positives = 244/329 (74%), Gaps = 30/329 (9%)
Query: 11 LKRPRLTPQP------------PPTPPNPQTLTTNKPSKWTPLNLTQSELSLPLTFPTGQ 58
+KRPR T QP P TP Q L KW PL LT +EL+LPLTFPTGQ
Sbjct: 1 MKRPRPTSQPSISSTVKPPLSPPVTPILKQKLHRTGTPKWFPLKLTHTELTLPLTFPTGQ 60
Query: 59 TFRWKKTGPLQYTGPIGPHLISLKHLQNGD-VCYHIHTSPSEPAAKSALLDFLNMGISLG 117
TFRWKKTG +QY+G IGPHL+SL+ D V Y +H S S +A+ ALLDFLN ISL
Sbjct: 61 TFRWKKTGAIQYSGTIGPHLVSLRQRPGDDAVSYCVHCSTSPKSAELALLDFLNAEISLA 120
Query: 118 ELWEGFSASDCRFAELAKYLAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGS 177
ELW FS D RF ELA++L GARVLRQDP+ECL+QFLCSSNNNIARITKMVDF++SLG
Sbjct: 121 ELWSDFSKKDPRFGELARHLRGARVLRQDPLECLIQFLCSSNNNIARITKMVDFVSSLGL 180
Query: 178 HLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTV 237
HLG+++GFEFH+FPSL+RLS VSE E R AGFGYR AKYITGTV
Sbjct: 181 HLGDIDGFEFHQFPSLDRLSRVSEEEFRKAGFGYR-----------------AKYITGTV 223
Query: 238 DVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
+ LQ+K GG EWLLSLRK++LQEA+ ALCTLPGVGPKVAACIALFSLDQH AIPVDTHV
Sbjct: 224 NALQAKPGGGNEWLLSLRKVELQEAVAALCTLPGVGPKVAACIALFSLDQHSAIPVDTHV 283
Query: 298 WKIATRYLLPELAGVRLTPKLCSRVAEAF 326
W+IAT YLLP+LAG +LTPKL RVAEAF
Sbjct: 284 WQIATNYLLPDLAGAKLTPKLHGRVAEAF 312
>sp|O15527|OGG1_HUMAN N-glycosylase/DNA lyase OS=Homo sapiens GN=OGG1 PE=1 SV=2
Length = 345
Score = 185 bits (469), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 101/279 (36%), Positives = 149/279 (53%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
+ P+ W + +SEL L L P+GQ+FRW++ P ++G + + +L + C
Sbjct: 17 ASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQTEEQLHCT 76
Query: 92 HIHTSPSE-----PAAKSALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
S+ P A+ + + ++L +L+ + + D F E+A+ G R+LRQD
Sbjct: 77 VYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P+ECL F+CSSNNNIARIT MV+ L + G L ++ +H FPSL+ L+ E L
Sbjct: 137 PIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y++ + + + GG WL LR+ +EA
Sbjct: 197 RKLGLGYR-----------------ARYVSASARAILEEQ-GGLAWLQQLRESSYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALC LPGVG KVA CI L +LD+ A+PVD H+W IA R
Sbjct: 239 ALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQR 277
>sp|O08760|OGG1_MOUSE N-glycosylase/DNA lyase OS=Mus musculus GN=Ogg1 PE=2 SV=2
Length = 345
Score = 181 bits (459), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 104/289 (35%), Positives = 155/289 (53%), Gaps = 26/289 (8%)
Query: 22 PTPPNPQTLTTNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISL 81
P+ +TL+++ P+ W + +SEL L L +GQ+FRWK+ P ++G + + +L
Sbjct: 8 PSSMRHRTLSSS-PALWASIPCPRSELRLDLVLASGQSFRWKEQSPAHWSGVLADQVWTL 66
Query: 82 KHLQNGDVCYHIHTSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKY 136
++ C S+ + + L + + +SL +L+ +++ D F +A+
Sbjct: 67 TQTEDQLYCTVYRGDDSQVSRPTLEELETLHKYFQLDVSLAQLYSHWASVDSHFQRVAQK 126
Query: 137 LAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLER 195
G R+LRQDP ECL F+CSSNNNIARIT MV+ L + G L ++ +H FP+L
Sbjct: 127 FQGVRLLRQDPTECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPNLHA 186
Query: 196 LSL-VSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSL 254
L+ +E LR G GYR A+Y+ + + + GG WL L
Sbjct: 187 LAGPEAETHLRKLGLGYR-----------------ARYVRASAKAILEEQ-GGPAWLQQL 228
Query: 255 RKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
R +EA ALCTLPGVG KVA CI L +LD+ A+PVD HVW+IA R
Sbjct: 229 RVAPYEEAHKALCTLPGVGAKVADCICLMALDKPQAVPVDVHVWQIAHR 277
>sp|O70249|OGG1_RAT N-glycosylase/DNA lyase OS=Rattus norvegicus GN=Ogg1 PE=2 SV=1
Length = 345
Score = 177 bits (450), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 100/279 (35%), Positives = 146/279 (52%), Gaps = 25/279 (8%)
Query: 32 TNKPSKWTPLNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGDVCY 91
T+ P+ W + +SEL L L +GQ+FRW++ P ++G + + +L ++ C
Sbjct: 17 TSSPALWASIPCPRSELRLDLVLASGQSFRWREQSPAHWSGVLADQVWTLTQTEDQLYCT 76
Query: 92 HIHTSPSEPAAKS-----ALLDFLNMGISLGELWEGFSASDCRFAELAKYLAGARVLRQD 146
+ + L + + +SL +L+ +++ D F +A+ G R+LRQD
Sbjct: 77 VYRGDKGQVGRPTLEELETLHKYFQLDVSLTQLYSHWASVDSHFQSVAQKFQGVRLLRQD 136
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLA-SLGSHLGNVEGFEFHEFPSLERLSLVS-EVEL 204
P ECL F+CSSNNNIARIT MV+ L + G L ++ +H FP+L L+ E L
Sbjct: 137 PTECLFSFICSSNNNIARITGMVERLCQAFGPRLVQLDDVTYHGFPNLHALAGPEVETHL 196
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAID 264
R G GYR A+Y+ + + + GG WL LR +EA
Sbjct: 197 RKLGLGYR-----------------ARYVCASAKAILEEQ-GGPAWLQQLRVASYEEAHK 238
Query: 265 ALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
ALCTLPGVG KVA CI L +LD+ A+PVD HVW+IA R
Sbjct: 239 ALCTLPGVGTKVADCICLMALDKPQAVPVDIHVWQIAHR 277
>sp|Q9V3I8|OGG1_DROME N-glycosylase/DNA lyase OS=Drosophila melanogaster GN=Ogg1 PE=2
SV=2
Length = 343
Score = 140 bits (353), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 95/298 (31%), Positives = 149/298 (50%), Gaps = 34/298 (11%)
Query: 41 LNLTQSELSLPLTFPTGQTFRWKKTGPLQYTGPIGPHLISLKHLQNGD--VCYHIHTSPS 98
+ L+ E L T GQ+FRW+ T G + LQ + + Y + + S
Sbjct: 27 IGLSLEECDLERTLLGGQSFRWRSICDGNRTKYGGVVFNTYWVLQQEESFITYEAYGTSS 86
Query: 99 EPAAK---SALLDFLNMGISLGELWEGFSASDCRFAELAKYLAG-ARVLRQDPVECLLQF 154
A K S + D+L + L + + + D F K+L+ R+L Q+P E + F
Sbjct: 87 PLATKDYSSLISDYLRVDFDLKVNQKDWLSKDDNFV---KFLSKPVRLLSQEPFENIFSF 143
Query: 155 LCSSNNNIARITKMVD-FLASLGSHLGNVEGFEFHEFPSLERLSLV----SEVELRNAGF 209
LCS NNNI RI+ M++ F A+ G+ +G+ G + + FP++ R + +LR A F
Sbjct: 144 LCSQNNNIKRISSMIEWFCATFGTKIGHFNGADAYTFPTINRFHDIPCEDLNAQLRAAKF 203
Query: 210 GYRSAPQSSLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTL 269
GYR AK+I T+ +Q K GG W +SL+ + ++A + L L
Sbjct: 204 GYR-----------------AKFIAQTLQEIQKK--GGQNWFISLKSMPFEKAREELTLL 244
Query: 270 PGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR-LTPKLCSRVAEAF 326
PG+G KVA CI L S+ ++PVD H+++IA Y LP L G + +T K+ V++ F
Sbjct: 245 PGIGYKVADCICLMSMGHLESVPVDIHIYRIAQNYYLPHLTGQKNVTKKIYEEVSKHF 302
>sp|P53397|OGG1_YEAST N-glycosylase/DNA lyase OS=Saccharomyces cerevisiae (strain ATCC
204508 / S288c) GN=OGG1 PE=1 SV=1
Length = 376
Score = 129 bits (323), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 91/285 (31%), Positives = 143/285 (50%), Gaps = 37/285 (12%)
Query: 37 KWTPLNLTQSELSLPLTFPTGQTFRW---KKTGPLQYTGPIGPH----LISLKHLQNGDV 89
K+ L + +SEL L GQ+FRW +K T IG ++ L+ + ++
Sbjct: 4 KFGKLAINKSELCLANVLQAGQSFRWIWDEKLNQYSTTMKIGQQEKYSVVILRQDEENEI 63
Query: 90 CYHIHTSP--SEPAAKSALLDFLNMGISLGELWEG-FSASDCRFAELAKYLAGARVLRQD 146
+ ++ A K+ L+ + + +SL L++ + SD FA+L+ G R+L Q+
Sbjct: 64 LEFVAVGDCGNQDALKTHLMKYFRLDVSLKHLFDNVWIPSDKAFAKLSP--QGIRILAQE 121
Query: 147 PVECLLQFLCSSNNNIARITKMVDFLAS-LGSHLGNVEGFEFHEFPSLERL-SLVSEVEL 204
P E L+ F+CSSNNNI+RIT+M + L S G+ + ++G +H FP+ E L S +E +L
Sbjct: 122 PWETLISFICSSNNNISRITRMCNSLCSNFGNLITTIDGVAYHSFPTSEELTSRATEAKL 181
Query: 205 RNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTV-----DVLQSKHSGGAEWLLSL-RKLD 258
R GFGYR AKYI T D ++ + +L S+ +
Sbjct: 182 RELGFGYR-----------------AKYIIETARKLVNDKAEANITSDTTYLQSICKDAQ 224
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
++ + L + GVGPKVA C+ L L +PVD HV +IA R
Sbjct: 225 YEDVREHLMSYNGVGPKVADCVCLMGLHMDGIVPVDVHVSRIAKR 269
>sp|O27397|OGG1_METTH Probable N-glycosylase/DNA lyase OS=Methanothermobacter
thermautotrophicus (strain ATCC 29096 / DSM 1053 / JCM
10044 / NBRC 100330 / Delta H) GN=MTH_1342 PE=3 SV=1
Length = 312
Score = 68.9 bits (167), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 53/175 (30%), Positives = 83/175 (47%), Gaps = 13/175 (7%)
Query: 133 LAKYLAGARVLR----QDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFH 188
L+ L +R LR +DP EC++ + S+N ++ R T+ ++ + L G FH
Sbjct: 92 LSYTLDSSRGLRLFLAKDPFECVISSIASANCSVVRWTRSIEDIRRLWGQANTFNGETFH 151
Query: 189 EFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFK---QAKYITGTVDVLQSKHS 245
FPS L+ V+E L + + P S F+ RS +A YI T +L +
Sbjct: 152 TFPSPHVLTGVAEGSLEDLQRAEDNLP-SDFSFNDLRSCGVGYRAPYIRETSRILAEEMD 210
Query: 246 GGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
+ + +D +A + L L GVGPKVA CI L+ + A PVD + +I
Sbjct: 211 -----IRRIDGMDYDDARELLLELSGVGPKVADCILLYGFRKTEAFPVDVWIRRI 260
>sp|P54137|NTH1_CAEEL Probable endonuclease III homolog OS=Caenorhabditis elegans
GN=nth-1 PE=1 SV=2
Length = 298
Score = 50.1 bits (118), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 41/76 (53%), Gaps = 11/76 (14%)
Query: 228 KQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
++A Y+ T +L+ SG D+ +++D LC LPGVGPK+A + + +
Sbjct: 122 RKAVYLQKTAKILKDDFSG-----------DIPDSLDGLCALPGVGPKMANLVMQIAWGE 170
Query: 288 HHAIPVDTHVWKIATR 303
I VDTHV +I+ R
Sbjct: 171 CVGIAVDTHVHRISNR 186
>sp|O35980|NTHL1_MOUSE Endonuclease III-like protein 1 OS=Mus musculus GN=Nthl1 PE=2 SV=1
Length = 300
Score = 45.4 bits (106), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V + KYI T +LQ ++ G D+ ++ L LPGVGPK+A
Sbjct: 163 LIYPVGFWRNKVKYIKQTTAILQQRYEG-----------DIPASVAELVALPGVGPKMAH 211
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +IA R
Sbjct: 212 LAMAVAWGTISGIAVDTHVHRIANR 236
>sp|Q2KID2|NTHL1_BOVIN Endonuclease III-like protein 1 OS=Bos taurus GN=NTHL1 PE=2 SV=1
Length = 305
Score = 45.1 bits (105), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 11/86 (12%)
Query: 218 SLLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVA 277
+L++ V + KYI T +LQ ++ G D+ ++ L LPGVGPK+A
Sbjct: 167 ALIYPVGFWRSKVKYIKQTSAILQQRYDG-----------DIPASVAELVALPGVGPKMA 215
Query: 278 ACIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +IA R
Sbjct: 216 HLAMAVAWGTVSGIAVDTHVHRIANR 241
>sp|P78549|NTHL1_HUMAN Endonuclease III-like protein 1 OS=Homo sapiens GN=NTHL1 PE=1 SV=2
Length = 312
Score = 44.3 bits (103), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 41/85 (48%), Gaps = 11/85 (12%)
Query: 219 LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAA 278
L++ V + KYI T +LQ +H GG D+ ++ L LPGVGPK+A
Sbjct: 175 LIYPVGFWRSKVKYIKQTSAILQ-QHYGG----------DIPASVAELVALPGVGPKMAH 223
Query: 279 CIALFSLDQHHAIPVDTHVWKIATR 303
+ I VDTHV +IA R
Sbjct: 224 LAMAVAWGTVSGIAVDTHVHRIANR 248
>sp|Q9SJQ6|ROS1_ARATH Protein ROS1 OS=Arabidopsis thaliana GN=ROS1 PE=1 SV=2
Length = 1393
Score = 42.4 bits (98), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 42/77 (54%), Gaps = 5/77 (6%)
Query: 228 KQAKYITGTVDVLQSKH-SGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLD 286
K A+ I G +D L + H S EWL R + +A + L + G+G K C+ L +L
Sbjct: 908 KLAERIQGFLDRLVNDHGSIDLEWL---RDVPPDKAKEYLLSFNGLGLKSVECVRLLTL- 963
Query: 287 QHHAIPVDTHVWKIATR 303
H A PVDT+V +IA R
Sbjct: 964 HHLAFPVDTNVGRIAVR 980
>sp|O49498|DML3_ARATH DEMETER-like protein 3 OS=Arabidopsis thaliana GN=DML3 PE=2 SV=2
Length = 1044
Score = 40.8 bits (94), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 266 LCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
L + G+G K A C+ L L +HHA PVDT+V +IA R
Sbjct: 584 LLEIEGIGLKSAECVRLLGL-KHHAFPVDTNVGRIAVR 620
>sp|P31378|NTG1_YEAST Mitochondrial DNA base excision repair N-glycosylase 1
OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c)
GN=NTG1 PE=1 SV=1
Length = 399
Score = 40.0 bits (92), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 32/117 (27%), Positives = 50/117 (42%), Gaps = 18/117 (15%)
Query: 181 NVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQAKYITGTVDVL 240
N+ + E S E ++L + +++ L+ SV ++AKYI T +L
Sbjct: 167 NIMRYCIDELHSEEGMTLEAVLQINETKL-------DELIHSVGFHTRKAKYILSTCKIL 219
Query: 241 QSKHSGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHV 297
Q + S D+ I+ L LPGVGPK+A + + I VD HV
Sbjct: 220 QDQFSS-----------DVPATINELLGLPGVGPKMAYLTLQKAWGKIEGICVDVHV 265
>sp|Q9SR66|DML2_ARATH DEMETER-like protein 2 OS=Arabidopsis thaliana GN=DML2 PE=3 SV=2
Length = 1332
Score = 39.3 bits (90), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 230 AKYITGTVDVLQSKH-SGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQH 288
A+ I ++ L KH S EWL R + +A + L ++ G+G K C+ L SL Q
Sbjct: 842 AERIKAFLNRLVKKHGSIDLEWL---RDVPPDKAKEYLLSINGLGLKSVECVRLLSLHQ- 897
Query: 289 HAIPVDTHVWKIATR 303
A PVDT+V +IA R
Sbjct: 898 IAFPVDTNVGRIAVR 912
>sp|Q8LK56|DME_ARATH Transcriptional activator DEMETER OS=Arabidopsis thaliana GN=DME PE=1
SV=2
Length = 1987
Score = 37.0 bits (84), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 35/63 (55%), Gaps = 6/63 (9%)
Query: 243 KHSGGA--EWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKI 300
K GG EWL R+ +A D L ++ G+G K C+ L +L + A PVDT+V +I
Sbjct: 1513 KDHGGIDLEWL---RESPPDKAKDYLLSIRGLGLKSVECVRLLTL-HNLAFPVDTNVGRI 1568
Query: 301 ATR 303
A R
Sbjct: 1569 AVR 1571
>sp|Q8SRB8|NTH1_ENCCU Endonuclease III homolog OS=Encephalitozoon cuniculi (strain GB-M1)
GN=NTH1 PE=1 SV=1
Length = 238
Score = 36.6 bits (83), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 34/70 (48%)
Query: 254 LRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATRYLLPELAGVR 313
LR+ L + L +LPG+G K+A + ++ I VDTHV +I+ R L V
Sbjct: 128 LREKGLPREMKDLISLPGIGNKMALLYMSHACNRTVGISVDTHVHRISNRIGLVRTRDVE 187
Query: 314 LTPKLCSRVA 323
T + RV
Sbjct: 188 STRRELERVV 197
>sp|Q09907|NTH1_SCHPO Endonuclease III homolog OS=Schizosaccharomyces pombe (strain 972 /
ATCC 24843) GN=nth1 PE=1 SV=1
Length = 355
Score = 35.0 bits (79), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 25/45 (55%)
Query: 258 DLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIAT 302
D+ + ++ L TLPGVGPK+ + ++ I VD HV +I
Sbjct: 125 DIPDTVEDLMTLPGVGPKMGYLCMSIAWNKTVGIGVDVHVHRICN 169
>sp|Q6AB99|RECR_PROAC Recombination protein RecR OS=Propionibacterium acnes (strain
KPA171202 / DSM 16379) GN=recR PE=3 SV=1
Length = 201
Score = 34.7 bits (78), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 18/28 (64%), Positives = 19/28 (67%)
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLD 286
LQE IDAL LPG+GPK A IA LD
Sbjct: 6 LQELIDALSRLPGIGPKGAQRIAFHILD 33
>sp|Q10355|YDB2_SCHPO Uncharacterized RNA-binding protein C22E12.02
OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
GN=SPAC22E12.02 PE=4 SV=1
Length = 219
Score = 33.9 bits (76), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 20/64 (31%), Positives = 33/64 (51%), Gaps = 2/64 (3%)
Query: 171 FLASLGSHLGNVEGFE-FHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQ 229
F+ +LG+ + + ++ F E+PSL + +V + E + GFG+ S S R K
Sbjct: 33 FVGNLGNDVNDESLYQAFSEYPSLVKTKVVRDREGKTRGFGFVSFKDSDQFLKAWRE-KN 91
Query: 230 AKYI 233
KYI
Sbjct: 92 GKYI 95
>sp|Q10630|ALKA_MYCTU Probable bifunctional transcriptional activator/DNA repair enzyme
AlkA OS=Mycobacterium tuberculosis GN=alkA PE=3 SV=1
Length = 496
Score = 33.1 bits (74), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 34/138 (24%), Positives = 53/138 (38%), Gaps = 31/138 (22%)
Query: 171 FLASLGSHLGNVEGFEFHEFPSLERLSLVSEVELRNAGFGYRSAPQSSLLFSVRRSFKQA 230
+A+ G + + G H FPS+E+L+ + G+ + P++ +
Sbjct: 342 LVAAYGRPVHDRHGALTHTFPSIEQLAEIDP--------GHLAVPKA-----------RQ 382
Query: 231 KYITGTVDVLQSKH---SGGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQ 287
+ I V L K G +W Q A L LPGVGP A IA+ L
Sbjct: 383 RTINALVASLADKSLVLDAGCDW---------QRARGQLLALPGVGPWTAEVIAMRGLGD 433
Query: 288 HHAIPVDTHVWKIATRYL 305
A P ++A + L
Sbjct: 434 PDAFPASDLGLRLAAKKL 451
>sp|A0R567|MUTY_MYCS2 A/G-specific adenine glycosylase OS=Mycobacterium smegmatis (strain
ATCC 700084 / mc(2)155) GN=mutY PE=3 SV=1
Length = 293
Score = 32.0 bits (71), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 17/41 (41%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 263 IDALCTLPGVGPKVAACIALFSLDQHHAIPVDTHVWKIATR 303
+D L TLPG+G A +A F+ Q VDT+V ++ TR
Sbjct: 106 VDTLLTLPGIGAYTARAVACFAY-QASVPVVDTNVRRVVTR 145
>sp|B7GFF3|RECR_ANOFW Recombination protein RecR OS=Anoxybacillus flavithermus (strain
DSM 21510 / WK1) GN=recR PE=3 SV=1
Length = 198
Score = 32.0 bits (71), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 13/36 (36%), Positives = 21/36 (58%)
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVD 294
+ + ID+ LPG+GPK A +A F L+ + +D
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLNMKEDVVLD 42
>sp|Q12SM9|ARGB_SHEDO Acetylglutamate kinase OS=Shewanella denitrificans (strain OS217 /
ATCC BAA-1090 / DSM 15013) GN=argB PE=3 SV=1
Length = 260
Score = 31.6 bits (70), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 49/203 (24%), Positives = 82/203 (40%), Gaps = 29/203 (14%)
Query: 137 LAGARVLRQDPVECLLQFLCSSNNNIARITKMVDFLASLGSHLGNVEGFEFHEFPSLERL 196
L G RV +D + ++ L ++N I + L S+G LG +G H ERL
Sbjct: 64 LDGLRVTPEDQIPIVVGALAGTSNKILQAAAAKAGLVSVGMSLG--DGNTVHAKIKDERL 121
Query: 197 SLVSEVELRNAGF-------GYR------SAPQSSLLFSV---RRSFKQAKYITGTVDVL 240
LV EV +A + G+ + L+ +V + + AK + G + VL
Sbjct: 122 GLVGEVSPNDATYLNFILDQGWLPICSSIAVSADGLMLNVNADQAATALAKLVNGNL-VL 180
Query: 241 QSKHS----GGAEWLLSLRKLDLQEAIDALCTLPGVGPKVAACIALFSLDQHHAIPVDTH 296
S S G + + SL K +++ + G+ KV A + Q PV
Sbjct: 181 LSDVSGVLDGKGQLIASLNKTEIETLVKQGVIEKGMKVKVEAA---LEVAQWMGKPVQVA 237
Query: 297 VWKIA---TRYLLPELAGVRLTP 316
W+ A + +L + G ++ P
Sbjct: 238 SWRDAEQLKKLVLGQSVGTQIQP 260
>sp|Q4JSL5|RECR_CORJK Recombination protein RecR OS=Corynebacterium jeikeium (strain
K411) GN=recR PE=3 SV=1
Length = 250
Score = 31.6 bits (70), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 15/30 (50%), Positives = 19/30 (63%)
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLDQH 288
LQ+ ID LPG+GPK A IAL L++
Sbjct: 6 LQDVIDEFSRLPGIGPKSAQRIALHLLNEE 35
>sp|A7GJT7|RECR_BACCN Recombination protein RecR OS=Bacillus cereus subsp. cytotoxis
(strain NVH 391-98) GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLD 286
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|Q6HPZ0|RECR_BACHK Recombination protein RecR OS=Bacillus thuringiensis subsp.
konkukian (strain 97-27) GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLD 286
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|Q63HE8|RECR_BACCZ Recombination protein RecR OS=Bacillus cereus (strain ZK / E33L)
GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLD 286
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|Q81JB9|RECR_BACCR Recombination protein RecR OS=Bacillus cereus (strain ATCC 14579 /
DSM 31) GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLD 286
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|B9IYI9|RECR_BACCQ Recombination protein RecR OS=Bacillus cereus (strain Q1) GN=recR
PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLD 286
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|B7HPT6|RECR_BACC7 Recombination protein RecR OS=Bacillus cereus (strain AH187)
GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLD 286
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|B7HIJ3|RECR_BACC4 Recombination protein RecR OS=Bacillus cereus (strain B4264)
GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLD 286
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|C1ES27|RECR_BACC3 Recombination protein RecR OS=Bacillus cereus (strain 03BB102)
GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLD 286
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|B7IS39|RECR_BACC2 Recombination protein RecR OS=Bacillus cereus (strain G9842)
GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLD 286
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|Q73FI4|RECR_BACC1 Recombination protein RecR OS=Bacillus cereus (strain ATCC 10987)
GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLD 286
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|B7JJD6|RECR_BACC0 Recombination protein RecR OS=Bacillus cereus (strain AH820)
GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLD 286
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|A0R899|RECR_BACAH Recombination protein RecR OS=Bacillus thuringiensis (strain Al
Hakam) GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLD 286
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|B7KW75|ARLY_METC4 Argininosuccinate lyase OS=Methylobacterium chloromethanicum
(strain CM4 / NCIMB 13688) GN=argH PE=3 SV=1
Length = 463
Score = 31.6 bits (70), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 43/84 (51%), Gaps = 5/84 (5%)
Query: 199 VSEVELRNAGFGYRSAPQSS--LLFSVRRSFKQAKYITGTVDVLQSKHSGGAEWLLSLRK 256
V+E R AG GY +A + L+ + F+QA ++TG V S+ G E LSL+
Sbjct: 353 VAETLKRAAGSGYATATDLADWLVRELNMPFRQAHHVTGRVVAAASERGIGLEE-LSLQD 411
Query: 257 LDLQEA--IDALCTLPGVGPKVAA 278
+ EA DA+ + GV VA+
Sbjct: 412 MQAVEAGITDAVFAVLGVENSVAS 435
>sp|Q81W16|RECR_BACAN Recombination protein RecR OS=Bacillus anthracis GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLD 286
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|C3LIZ7|RECR_BACAC Recombination protein RecR OS=Bacillus anthracis (strain CDC 684 /
NRRL 3495) GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLD 286
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|C3P9H0|RECR_BACAA Recombination protein RecR OS=Bacillus anthracis (strain A0248)
GN=recR PE=3 SV=1
Length = 198
Score = 31.6 bits (70), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLD 286
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
>sp|A9VN38|RECR_BACWK Recombination protein RecR OS=Bacillus weihenstephanensis (strain
KBAB4) GN=recR PE=3 SV=1
Length = 198
Score = 31.2 bits (69), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 259 LQEAIDALCTLPGVGPKVAACIALFSLD 286
+ + ID+ LPG+GPK A +A F LD
Sbjct: 7 ISKLIDSFMKLPGIGPKTAVRLAFFVLD 34
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.135 0.415
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 126,950,219
Number of Sequences: 539616
Number of extensions: 5325220
Number of successful extensions: 29971
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 54
Number of HSP's successfully gapped in prelim test: 71
Number of HSP's that attempted gapping in prelim test: 29046
Number of HSP's gapped (non-prelim): 824
length of query: 328
length of database: 191,569,459
effective HSP length: 118
effective length of query: 210
effective length of database: 127,894,771
effective search space: 26857901910
effective search space used: 26857901910
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (28.1 bits)