BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 001299
(1105 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q8LK56|DME_ARATH Transcriptional activator DEMETER OS=Arabidopsis thaliana GN=DME PE=1
SV=2
Length = 1987
Score = 628 bits (1620), Expect = e-179, Method: Compositional matrix adjust.
Identities = 307/534 (57%), Positives = 382/534 (71%), Gaps = 10/534 (1%)
Query: 568 WDLLRRIYSTGE---ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 624
WD LR+ E ER+ + MDS+D+EA+R A SEI++AIKERG N++A RIK+FL
Sbjct: 1450 WDSLRKDVEGNEGRQERNKNNMDSIDYEAIRRASISEISEAIKERGMNNMLAVRIKDFLE 1509
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R+V+ HG IDLEWLR PPDK K+YLL I GLGLKSVECVRLL+L ++AFPVD NVGRIA
Sbjct: 1510 RIVKDHGGIDLEWLRESPPDKAKDYLLSIRGLGLKSVECVRLLTLHNLAFPVDTNVGRIA 1569
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
VR+GWVPLQPLP L +HLL+ YPV++ IQ +LWPRLC LDQ+TLYELHYQ+ITFGK+FC
Sbjct: 1570 VRMGWVPLQPLPESLQLHLLELYPVLESIQKFLWPRLCKLDQRTLYELHYQLITFGKVFC 1629
Query: 745 KKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPV 804
K PNC ACP+RGEC+HFASA ASAR ALP P E+ + ++ PP+ + I +
Sbjct: 1630 TKSRPNCNACPMRGECRHFASAYASARLALPAPEERSLTSATIPVPPESYPPVAIPMIEL 1689
Query: 805 -IRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQ 863
+ +E S + NCEPIIEEP SPG +C+E ES+I+D ++ D +EIPT++L
Sbjct: 1690 PLPLEKSLASGAPSNRENCEPIIEEPASPGQECTEITESDIEDAYYN-EDPDEIPTIKLN 1748
Query: 864 EREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPD 920
+F ME N + LQ+ S ALVAL S+PT KLK + LRTEH VY LPD
Sbjct: 1749 IEQFGMTLREHMERN-MELQEGDMSKALVALHPTTTSIPTPKLKNISRLRTEHQVYELPD 1807
Query: 921 AHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSV 979
+H LL G D+R+PDDPSPYLLAIW+PGE+ NS P+ +C K S K+C E C CNS+
Sbjct: 1808 SHRLLDGMDKREPDDPSPYLLAIWTPGETANSAQPPEQKCGGKASGKMCFDETCSECNSL 1867
Query: 980 CERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRRE 1039
E N+ VRGT+LIPCRTA RG FPLNGTYFQVNE+FADHE+S PI+VPR I +L R
Sbjct: 1868 REANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNELFADHESSLKPIDVPRDWIWDLPRR 1927
Query: 1040 IVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
V FG+S T++F L QIQ FW+GFVCVRGF+++ R PRPL+ RLH SK
Sbjct: 1928 TVYFGTSVTSIFRGLSTEQIQFCFWKGFVCVRGFEQKTRAPRPLMARLHFPASK 1981
Score = 137 bits (344), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 88/194 (45%), Positives = 122/194 (62%), Gaps = 20/194 (10%)
Query: 76 LMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQ 135
L Q + A+II + ++L + D + NA+V Y+ GALVP++
Sbjct: 904 LCQDSIAEIIYRMQNLYLGDKERE------------QEQNAMVLYK-----GDGALVPYE 946
Query: 136 IKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSF 195
K++K +P+V +D T R+WNL+M D D+ ++ EKWW++ER VF GR SF
Sbjct: 947 --SKKRKPRPKVDIDDETTRIWNLLMGKGDEKEGDEEKDKKKEKWWEEERRVFRGRADSF 1004
Query: 196 TARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPS 255
ARMHL+ GDRRF PWKGSVVDSV+GV+LTQNVSD+LSSSA+MSLAARFP K ++
Sbjct: 1005 IARMHLVQGDRRFSPWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPPKLSSSRE-D 1063
Query: 256 DENLRTTASLEPIG 269
+ N+R+ +P G
Sbjct: 1064 ERNVRSVVVEDPEG 1077
>sp|Q9SJQ6|ROS1_ARATH Protein ROS1 OS=Arabidopsis thaliana GN=ROS1 PE=1 SV=2
Length = 1393
Score = 622 bits (1605), Expect = e-177, Method: Compositional matrix adjust.
Identities = 312/538 (57%), Positives = 374/538 (69%), Gaps = 22/538 (4%)
Query: 567 DWDLLRR---IYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
DWD LRR + E++ MD+VDW+A+R AD E+A+ IK RG + +A RI+ FL
Sbjct: 858 DWDCLRREAQARAGIREKTRSTMDTVDWKAIRAADVKEVAETIKSRGMNHKLAERIQGFL 917
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
+RLV HGSIDLEWLR+VPPDK KEYLL GLGLKSVECVRLL+L H+AFPVD NVGRI
Sbjct: 918 DRLVNDHGSIDLEWLRDVPPDKAKEYLLSFNGLGLKSVECVRLLTLHHLAFPVDTNVGRI 977
Query: 684 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
AVRLGWVPLQPLP L +HLL+ YP+++ IQ YLWPRLC LDQKTLYELHYQMITFGK+F
Sbjct: 978 AVRLGWVPLQPLPESLQLHLLEMYPMLESIQKYLWPRLCKLDQKTLYELHYQMITFGKVF 1037
Query: 744 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVN-PI 802
C K PNC ACP++GEC+HFASA ASAR ALP ++E G G PL ++ P
Sbjct: 1038 CTKSKPNCNACPMKGECRHFASAFASARLALP--------STEKGMGTPDKNPLPLHLPE 1089
Query: 803 PVIRIEADPV---SESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPT 859
P R + V SE ++ CEPIIEEP SP P+ +E ++I++ F D EEIPT
Sbjct: 1090 PFQREQGSEVVQHSEPAKKVTCCEPIIEEPASPEPETAEVSIADIEEAFFE--DPEEIPT 1147
Query: 860 LRLQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVY 916
+RL F N ME NK LQD SSALVALTAE AS+P KLK + LRTEH VY
Sbjct: 1148 IRLNMDAFTSNLKKIMEHNK-ELQDGNMSSALVALTAETASLPMPKLKNISQLRTEHRVY 1206
Query: 917 VLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRC-NSKDSKLCNSEICYS 975
LPD H LL ++R+PDDP YLLAIW+PGE+ +S+ C + LC+ E C+S
Sbjct: 1207 ELPDEHPLLAQLEKREPDDPCSYLLAIWTPGETADSIQPSVSTCIFQANGMLCDEETCFS 1266
Query: 976 CNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIAN 1035
CNS+ E + IVRGTILIPCRTA RG FPLNGTYFQVNEVFADH +S +PINVPR I
Sbjct: 1267 CNSIKETRSQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHASSLNPINVPRELIWE 1326
Query: 1036 LRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
L R V FG+S T+F L +IQ FW+G+VCVRGFDR+ R P+PL+ RLH SK
Sbjct: 1327 LPRRTVYFGTSVPTIFKGLSTEKIQACFWKGYVCVRGFDRKTRGPKPLIARLHFPASK 1384
Score = 114 bits (284), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 72/136 (52%), Positives = 93/136 (68%), Gaps = 7/136 (5%)
Query: 116 AVVPYQVGP-----SSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKD 170
A+VPY + GA+VP K+ + +P+V LD T R+W L++ + D
Sbjct: 494 ALVPYTMNSQIVLFGGGAGAIVP-VTPVKKPRPRPKVDLDDETDRVWKLLLENINSEGVD 552
Query: 171 QTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSD 230
S+E KWW++ER VF GR SF ARMHL+ GDRRF PWKGSVVDSVVGV+LTQNVSD
Sbjct: 553 -GSDEQKAKWWEEERNVFRGRADSFIARMHLVQGDRRFTPWKGSVVDSVVGVFLTQNVSD 611
Query: 231 NLSSSAYMSLAARFPL 246
+LSSSA+MSLA++FP+
Sbjct: 612 HLSSSAFMSLASQFPV 627
>sp|Q9SR66|DML2_ARATH DEMETER-like protein 2 OS=Arabidopsis thaliana GN=DML2 PE=3 SV=2
Length = 1332
Score = 580 bits (1496), Expect = e-164, Method: Compositional matrix adjust.
Identities = 323/717 (45%), Positives = 428/717 (59%), Gaps = 39/717 (5%)
Query: 404 ITQSSVTQFWP-TGNSTADVASPSKT---------------CIKESSIAASTEIPQLENT 447
+TQ ++ P TG ST + +P++ C +ESS + + I + +
Sbjct: 613 VTQETILNLDPRTGVSTPRIRNPTRVIIEEIDDDENDIDAVCSQESSKTSDSSITSADQS 672
Query: 448 -ALLQDKVDGILFCDECLDGYTKPTRIDNGNQASTSGRNDLKSDFRSISSSDFNDPFEIS 506
+L D + +L ++ K G+ T NDL +SS+ + ++
Sbjct: 673 KTMLLDPFNTVLMNEQVDSQMVK----GKGHIPYTDDLNDLSQGISMVSSASTHCELNLN 728
Query: 507 VLPSNRESFRTGM-PQAHDATTTSKKSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQ 565
+P E P++ T ++S R + K K + KK+ S
Sbjct: 729 EVPPEVELCSHQQDPESTIQTQDQQESTRTEDVKKNRKKPTTSKPKKKSKESAKSTQKKS 788
Query: 566 QDWDLLRRIYSTG---EERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEF 622
DWD LR+ +G ER+ MD+VDW+A+RC D +IA+ I +RG N++A RIK F
Sbjct: 789 VDWDSLRKEAESGGRKRERTERTMDTVDWDALRCTDVHKIANIIIKRGMNNMLAERIKAF 848
Query: 623 LNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGR 682
LNRLV+ HGSIDLEWLR+VPPDK KEYLL I GLGLKSVECVRLLSL IAFPVD NVGR
Sbjct: 849 LNRLVKKHGSIDLEWLRDVPPDKAKEYLLSINGLGLKSVECVRLLSLHQIAFPVDTNVGR 908
Query: 683 IAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKI 742
IAVRLGWVPLQPLP +L +HLL+ YPV++ +Q YLWPRLC LDQKTLYELHY MITFGK+
Sbjct: 909 IAVRLGWVPLQPLPDELQMHLLELYPVLESVQKYLWPRLCKLDQKTLYELHYHMITFGKV 968
Query: 743 FCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPI 802
FC K PNC ACP++ EC+H++SA ASAR ALP P E + + P+VVN
Sbjct: 969 FCTKVKPNCNACPMKAECRHYSSARASARLALPEPEESDRTSVMIHERRSKRKPVVVNFR 1028
Query: 803 PVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEE------ 856
P + + + E+ + NCEPIIEEP SP P+ E + D ++G E+
Sbjct: 1029 PSLFLYQEKEQEA-QRSQNCEPIIEEPASPEPEYIEHDIEDYPRDKNNVGTSEDPWENKD 1087
Query: 857 -IPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHV 915
IPT+ L + + + + +NK S LV L+ AA++P RKLK LRTEHHV
Sbjct: 1088 VIPTIILNK---EAGTSHDLVVNKEA-GTSHDLVVLSTYAAAIPRRKLKIKEKLRTEHHV 1143
Query: 916 YVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCN--SKDSKLCNSEIC 973
+ LPD H +L GF+RR+ +D PYLLAIW+PGE+ NS+ P+ RC ++ LCN C
Sbjct: 1144 FELPDHHSILEGFERREAEDIVPYLLAIWTPGETVNSIQPPKQRCALFESNNTLCNENKC 1203
Query: 974 YSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTI 1033
+ CN E + VRGTILIPCRTA RG FPLNGTYFQ NEVFADH++S +PI+VP I
Sbjct: 1204 FQCNKTREEESQTVRGTILIPCRTAMRGGFPLNGTYFQTNEVFADHDSSINPIDVPTELI 1263
Query: 1034 ANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCS 1090
+L+R + GSS +++ L + I+ F G+VCVRGFDR R P+ LV RLHCS
Sbjct: 1264 WDLKRRVAYLGSSVSSICKGLSVEAIKYNFQEGYVCVRGFDRENRKPKSLVKRLHCS 1320
Score = 91.3 bits (225), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 80/171 (46%), Positives = 105/171 (61%), Gaps = 17/171 (9%)
Query: 78 QSTTADIIRK-FKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQI 136
+S T D I K F++L I G LP N A++ Y+ E A+V +
Sbjct: 443 RSLTIDAITKLFEELDINKEGLCLP---------HNRETALILYKKS-YEEQKAIVKYS- 491
Query: 137 KEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFT 196
KK KP+V LDP T R+W L+M+ D D S+E+ KWW++ER +F GR SF
Sbjct: 492 ----KKQKPKVQLDPETSRVWKLLMSSIDCDGVD-GSDEEKRKWWEEERNMFHGRANSFI 546
Query: 197 ARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLK 247
ARM ++ G+R F PWKGSVVDSVVGV+LTQNV+D+ SSSAYM LAA FP++
Sbjct: 547 ARMRVVQGNRTFSPWKGSVVDSVVGVFLTQNVADHSSSSAYMDLAAEFPVE 597
>sp|O49498|DML3_ARATH DEMETER-like protein 3 OS=Arabidopsis thaliana GN=DML3 PE=2 SV=2
Length = 1044
Score = 509 bits (1312), Expect = e-143, Method: Compositional matrix adjust.
Identities = 268/538 (49%), Positives = 342/538 (63%), Gaps = 14/538 (2%)
Query: 567 DWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL 626
DW+ LRR+Y+ R MDSV+W VR + ++ + IK+RGQ I++ RI +FLN
Sbjct: 501 DWNNLRRMYTKEGSRPEMHMDSVNWSDVRLSGQNVLETTIKKRGQFRILSERILKFLNDE 560
Query: 627 VELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVR 686
V +G+IDLEWLRN P VK YLL+IEG+GLKS ECVRLL L+H AFPVD NVGRIAVR
Sbjct: 561 VNQNGNIDLEWLRNAPSHLVKRYLLEIEGIGLKSAECVRLLGLKHHAFPVDTNVGRIAVR 620
Query: 687 LGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKK 746
LG VPL+PLP + +H L EYP MD IQ YLWPRLC L Q+TLYELHYQMITFGK+FC K
Sbjct: 621 LGLVPLEPLPNGVQMHQLFEYPSMDSIQKYLWPRLCKLPQETLYELHYQMITFGKVFCTK 680
Query: 747 RSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIR 806
PNC ACP++ ECK+FASA S++ L P EK + F N Q + V+ I
Sbjct: 681 TIPNCNACPMKSECKYFASAYVSSKVLLESPEEKMHEPNTFMNAHSQD--VAVDMTSNIN 738
Query: 807 IEADPVSESGYQINNC-EPIIEEPRSPGPQCSESPESEIDDDDFS--IGDIEEIPTLRLQ 863
+ + VS C +P++E P SP + ES ++I+D F +P +
Sbjct: 739 LVEECVSSGCSDQAICYKPLVEFPSSPRAEIPES--TDIEDVPFMNLYQSYASVPKIDFD 796
Query: 864 EREFKENFPNFMEMNKVMLQD----SSALVALTAEAASVPT---RKLKRCAHLRTEHHVY 916
K++ + + ++ M S ALV T E A +P RK+K LRTEH VY
Sbjct: 797 LDALKKSVEDALVISGRMSSSDEEISKALVIPTPENACIPIKPPRKMKYYNRLRTEHVVY 856
Query: 917 VLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDSKLCNSEICYSC 976
VLPD HELL F+RR DDPSPYLLAIW PGE+ +S P+ +C+S SKLC + C C
Sbjct: 857 VLPDNHELLHDFERRKLDDPSPYLLAIWQPGETSSSFVPPKKKCSSDGSKLCKIKNCSYC 916
Query: 977 NSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANL 1036
++ E+N+ I RGTILIPCRTA RG FPLNGTYFQ NEVFADHETS +PI R L
Sbjct: 917 WTIREQNSNIFRGTILIPCRTAMRGAFPLNGTYFQTNEVFADHETSLNPIVFRRELCKGL 976
Query: 1037 RREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKK 1094
+ + GS+ T++F LD R+I+ FW GF+C+R FDR+ R P+ LV RLH P ++
Sbjct: 977 EKRALYCGSTVTSIFKLLDTRRIELCFWTGFLCLRAFDRKQRDPKELVRRLHTPPDER 1034
Score = 94.0 bits (232), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 57/115 (49%), Positives = 82/115 (71%), Gaps = 2/115 (1%)
Query: 132 VPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGR 191
V IK +K + +V LDP T++ W+++M ++D ++ E E W+KERE+F+ R
Sbjct: 334 VTTMIKADKKLVTAKVNLDPETIKEWDVLM-VNDSPSRSYDDKE-TEAKWKKEREIFQTR 391
Query: 192 IQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPL 246
I F RMH + G+R+FK WKGSVVDSVVGV+LTQN +D LSS+A+MS+AA+FP+
Sbjct: 392 IDLFINRMHRLQGNRKFKQWKGSVVDSVVGVFLTQNTTDYLSSNAFMSVAAKFPV 446
>sp|Q9WYK0|END3_THEMA Endonuclease III OS=Thermotoga maritima (strain ATCC 43589 / MSB8 /
DSM 3109 / JCM 10099) GN=nth PE=3 SV=1
Length = 213
Score = 57.0 bits (136), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 50/167 (29%), Positives = 73/167 (43%), Gaps = 27/167 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
A ++ D IKE G A RI E LVE +G PD ++E LL + G+
Sbjct: 60 AKPEDLYDLIKESGMYRQKAERIVEISRILVEKYGG--------RVPDSLEE-LLKLPGV 110
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ V + + A VD +V RI+ RLGWV + P + L K P
Sbjct: 111 GRKTANIVLWVGFKKPALAVDTHVHRISNRLGWVKTRT-PEETEEALKKLLP------ED 163
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
LW ++ M+ FG+ CK ++P C C L+ C+ +
Sbjct: 164 LWG-----------PINGSMVEFGRRICKPQNPLCEECFLKNHCEFY 199
>sp|O05956|END3_RICPR Endonuclease III OS=Rickettsia prowazekii (strain Madrid E) GN=nth
PE=3 SV=1
Length = 212
Score = 47.0 bits (110), Expect = 8e-04, Method: Composition-based stats.
Identities = 33/127 (25%), Positives = 56/127 (44%), Gaps = 19/127 (14%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
P+ KE L+ + G+G K+ V VD +V R++ R+G G+
Sbjct: 105 PNNFKE-LVKLPGVGRKTANVVLNCLFAMPTMAVDTHVFRVSKRIGLAK-----GNTAAI 158
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+ KE L +D+K L H+ +I G+ CK R P C CP++ C++
Sbjct: 159 VEKEL-------------LQIIDEKWLTYAHHWLILHGRYICKARKPGCNICPIKEYCEY 205
Query: 763 FASAVAS 769
+ + +S
Sbjct: 206 YINTFSS 212
>sp|P39788|END3_BACSU Probable endonuclease III OS=Bacillus subtilis (strain 168) GN=nth
PE=3 SV=1
Length = 219
Score = 47.0 bits (110), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 70/161 (43%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ IK G A I++ ++E +G VP D+ + L+ + G+G K+
Sbjct: 72 ELQQDIKSIGLYRNKAKNIQKLSKMIIEDYGG-------EVPRDR--DELVKLPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + ++ L+++ P D W
Sbjct: 123 ANVVVSVAFGVPAIAVDTHVERVSKRLGICRWKDSVLEVEKTLMRKVPKED------WS- 175
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H+++I FG+ CK +SP C CPL C+
Sbjct: 176 ----------VTHHRLIFFGRYHCKAQSPRCAECPLLSLCR 206
>sp|Q58829|Y1434_METJA Putative endonuclease MJ1434 OS=Methanocaldococcus jannaschii
(strain ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC
100440) GN=MJ1434 PE=3 SV=1
Length = 220
Score = 46.6 bits (109), Expect = 0.001, Method: Composition-based stats.
Identities = 42/165 (25%), Positives = 74/165 (44%), Gaps = 20/165 (12%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKV--KEYLLDIEG 655
DE ++ + I+ G N+ A R+K +VE +G + E + D + + LL I G
Sbjct: 71 DEDKLKELIRPAGFYNLKAKRLKNVTKFIVENYG--NTEEMAKTDKDTLILRAELLSING 128
Query: 656 LGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQM 715
+G ++ + + L +L +F VD R+ RLG ++ E D+I+
Sbjct: 129 VGKETADSILLYALDRESFVVDAYTKRMFSRLG--------------VINEKAKYDEIKE 174
Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
L D + E H ++ K FC+K++ C CP++ C
Sbjct: 175 IFEKNLPK-DLEIYKEYHALIVEHCKKFCRKKAL-CDNCPIKEFC 217
>sp|Q68W04|END3_RICTY Endonuclease III OS=Rickettsia typhi (strain ATCC VR-144 /
Wilmington) GN=nth PE=3 SV=1
Length = 212
Score = 45.4 bits (106), Expect = 0.003, Method: Composition-based stats.
Identities = 32/127 (25%), Positives = 57/127 (44%), Gaps = 19/127 (14%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
P+ KE L+ + G+G K+ V VD +V R++ R+G G+ +
Sbjct: 105 PNDFKE-LVKLPGVGRKTANVVLNCLFAMPTMAVDTHVFRVSKRIGLAK-----GNTTVI 158
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+ KE L +D+K L H+ ++ G+ CK R P+C C ++ C++
Sbjct: 159 VEKEL-------------LQIIDEKWLTHAHHWLVLHGRYICKARKPSCRICHIKEYCEY 205
Query: 763 FASAVAS 769
+ + AS
Sbjct: 206 YINTFAS 212
>sp|Q4UK93|END3_RICFE Endonuclease III OS=Rickettsia felis (strain ATCC VR-1525 /
URRWXCal2) GN=nth PE=3 SV=1
Length = 213
Score = 45.1 bits (105), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/127 (25%), Positives = 56/127 (44%), Gaps = 19/127 (14%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
P+ KE L+ + G+G K+ V VD +V R+A R+G P +
Sbjct: 105 PNDFKE-LIKLPGVGRKTANVVLNCLFGMPTMAVDTHVFRVAKRIGLARGNS-PEIVEKE 162
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
LL+ +++K L H+ +I G+ CK R P+C CP++ C++
Sbjct: 163 LLQ-----------------IINEKWLTHAHHWLILHGRYICKARKPDCDICPIKEYCEY 205
Query: 763 FASAVAS 769
+ S + S
Sbjct: 206 YNSPIIS 212
>sp|Q92GH4|END3_RICCN Endonuclease III OS=Rickettsia conorii (strain ATCC VR-613 / Malish
7) GN=nth PE=3 SV=1
Length = 210
Score = 42.7 bits (99), Expect = 0.016, Method: Composition-based stats.
Identities = 30/114 (26%), Positives = 50/114 (43%), Gaps = 18/114 (15%)
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L+ + G+G K+ V VD +V R+A R+G GD P
Sbjct: 111 LIKLPGVGRKTANVVLNCLFGMPTMAVDTHVFRVANRIGLAK-----GDT--------PE 157
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+ + ++ L +D K L H+ +I G+ CK R P+C CP++ C ++
Sbjct: 158 IVENEL-----LQIIDTKWLTHAHHWLILHGRYICKARKPDCDICPIKEYCDYY 206
>sp|Q9V3I8|OGG1_DROME N-glycosylase/DNA lyase OS=Drosophila melanogaster GN=Ogg1 PE=2
SV=2
Length = 343
Score = 41.6 bits (96), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 34/48 (70%), Gaps = 1/48 (2%)
Query: 638 LRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI-AFPVDVNVGRIA 684
L+++P +K +E L + G+G K +C+ L+S+ H+ + PVD+++ RIA
Sbjct: 229 LKSMPFEKAREELTLLPGIGYKVADCICLMSMGHLESVPVDIHIYRIA 276
>sp|A9M2W6|PYRC_NEIM0 Dihydroorotase OS=Neisseria meningitidis serogroup C (strain
053442) GN=pyrC PE=3 SV=1
Length = 344
Score = 40.4 bits (93), Expect = 0.074, Method: Composition-based stats.
Identities = 30/119 (25%), Positives = 53/119 (44%), Gaps = 7/119 (5%)
Query: 212 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 271
+GS + ++ +YLT N + L A + F L T SD + L P+
Sbjct: 65 EGSAFEPLMTLYLTDNATSELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124
Query: 272 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 330
G ++ G VT+PE D + DR+ AF R+ + L + ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176
>sp|Q9K0D1|PYRC_NEIMB Dihydroorotase OS=Neisseria meningitidis serogroup B (strain MC58)
GN=pyrC PE=3 SV=1
Length = 344
Score = 40.0 bits (92), Expect = 0.12, Method: Composition-based stats.
Identities = 30/119 (25%), Positives = 53/119 (44%), Gaps = 7/119 (5%)
Query: 212 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 271
+GS + ++ +YLT N + L A + F L T SD + L P+
Sbjct: 65 EGSAFEPLMTLYLTDNATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124
Query: 272 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 330
G ++ G VT+PE D + DR+ AF R+ + L + ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176
>sp|O08760|OGG1_MOUSE N-glycosylase/DNA lyase OS=Mus musculus GN=Ogg1 PE=2 SV=2
Length = 345
Score = 39.7 bits (91), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 39/68 (57%), Gaps = 2/68 (2%)
Query: 626 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSL-QHIAFPVDVNVGRIA 684
L E G L+ LR P ++ + L + G+G K +C+ L++L + A PVDV+V +IA
Sbjct: 216 LEEQGGPAWLQQLRVAPYEEAHKALCTLPGVGAKVADCICLMALDKPQAVPVDVHVWQIA 275
Query: 685 VR-LGWVP 691
R GW P
Sbjct: 276 HRDYGWHP 283
>sp|P29588|GTMR_METTF G/T mismatches repair enzyme OS=Methanobacterium thermoformicicum
PE=1 SV=1
Length = 221
Score = 38.9 bits (89), Expect = 0.22, Method: Composition-based stats.
Identities = 45/197 (22%), Positives = 78/197 (39%), Gaps = 27/197 (13%)
Query: 570 LLRRIYSTGEERSHDK--MDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLV 627
LLRR + ++ +DK + +E + +SEIA IKE G N A ++KE ++
Sbjct: 44 LLRRTTAGHVKKIYDKFFVKYKCFEDILKTPKSEIAKDIKEIGLSNQRAEQLKELARVVI 103
Query: 628 ELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRL 687
+G VP + ++ +LD+ G+G + V L+ A VD N R+ R
Sbjct: 104 NDYGG-------RVP--RNRKAILDLPGVGKYTCAAVMCLAFGKKAAMVDANFVRVINR- 153
Query: 688 GWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL-CYLDQKTLYELHYQMITFGKIFCKK 746
+ Y ++ LW + + + ++ F I C
Sbjct: 154 --------------YFGGSYENLNYNHKALWELAETLVPGGKCRDFNLGLMDFSAIICAP 199
Query: 747 RSPNCGACPLRGECKHF 763
R P C C + C ++
Sbjct: 200 RKPKCEKCGMSKLCSYY 216
>sp|P46303|UVEN_MICLC Ultraviolet N-glycosylase/AP lyase OS=Micrococcus luteus (strain
ATCC 4698 / DSM 20030 / JCM 1464 / NBRC 3333 / NCIMB
9278 / NCTC 2665 / VKM Ac-2230) GN=pdg PE=1 SV=2
Length = 279
Score = 37.7 bits (86), Expect = 0.54, Method: Composition-based stats.
Identities = 42/178 (23%), Positives = 69/178 (38%), Gaps = 28/178 (15%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDKVKE 648
D A+ A E E+ + ++ G A I LV H G + P ++++
Sbjct: 75 DAHAMAAATEPELQELVRSTGFYRNKASAILRLSQELVGRHDGEV---------PARLED 125
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L+ + G+G K+ V + VD + GR+A RLG+ + PG + P
Sbjct: 126 -LVALPGVGRKTAFVVLGNAFGQPGITVDTHFGRLARRLGFTD-ETDPGKGRARRGRPVP 183
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASA 766
M L +++I G+ C R P CG CP+ C +A+
Sbjct: 184 PARDWTM----------------LSHRLIFHGRRVCHARRPACGRCPIARWCPSYAAG 225
>sp|Q5F9Y1|PYRC_NEIG1 Dihydroorotase OS=Neisseria gonorrhoeae (strain ATCC 700825 / FA
1090) GN=pyrC PE=3 SV=1
Length = 344
Score = 37.7 bits (86), Expect = 0.60, Method: Composition-based stats.
Identities = 29/119 (24%), Positives = 52/119 (43%), Gaps = 7/119 (5%)
Query: 212 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 271
+GS + ++ +YLT + L A + F L T SD + L P+
Sbjct: 65 EGSAFEPLMTLYLTDQATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124
Query: 272 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 330
G ++ G VT+PE D + DR+ AF R+ + L + ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176
>sp|B4RJT2|PYRC_NEIG2 Dihydroorotase OS=Neisseria gonorrhoeae (strain NCCP11945) GN=pyrC
PE=3 SV=1
Length = 344
Score = 37.4 bits (85), Expect = 0.60, Method: Composition-based stats.
Identities = 29/119 (24%), Positives = 52/119 (43%), Gaps = 7/119 (5%)
Query: 212 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 271
+GS + ++ +YLT + L A + F L T SD + L P+
Sbjct: 65 EGSAFEPLMTLYLTDQATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124
Query: 272 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 330
G ++ G VT+PE D + DR+ AF R+ + L + ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176
>sp|A0R6D9|MAK_MYCS2 Maltokinase OS=Mycobacterium smegmatis (strain ATCC 700084 /
mc(2)155) GN=mak PE=3 SV=1
Length = 441
Score = 37.4 bits (85), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 44/95 (46%), Gaps = 10/95 (10%)
Query: 74 QQLMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVP 133
++L+ +TTA +R +RDG + LQ N ADGP + +V GP E+ +
Sbjct: 20 RELVSATTAMAVR------LRDGLELVLLQANYADGPDERYQVIVATGSGPIDEYSVVAT 73
Query: 134 HQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTT 168
I + + + + DP R L+ ID+ T
Sbjct: 74 IGIADGQTAY--DALYDPDATRY--LLSLIDESAT 104
>sp|Q9JVD6|PYRC_NEIMA Dihydroorotase OS=Neisseria meningitidis serogroup A / serotype 4A
(strain Z2491) GN=pyrC PE=3 SV=1
Length = 344
Score = 37.0 bits (84), Expect = 0.83, Method: Composition-based stats.
Identities = 29/119 (24%), Positives = 52/119 (43%), Gaps = 7/119 (5%)
Query: 212 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 271
+GS + ++ +YLT + L A + F L T SD + L P+
Sbjct: 65 EGSAFEPLMTLYLTDQATPALVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124
Query: 272 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 330
G ++ G VT+PE D + DR+ AF R+ + L + ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176
>sp|Q9CB92|END3_MYCLE Endonuclease III OS=Mycobacterium leprae (strain TN) GN=nth PE=3
SV=2
Length = 245
Score = 36.6 bits (83), Expect = 1.2, Method: Composition-based stats.
Identities = 47/179 (26%), Positives = 68/179 (37%), Gaps = 27/179 (15%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD +E+ + I+ G A + LVE D E VP V L + G+
Sbjct: 78 ADRAELENFIRPTGFFRNKAASLIRLGQALVE---RFDGE----VPSTMVD--LFTLPGV 128
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ + + VD + GR+ R W +E PV K++
Sbjct: 129 GRKTANVILGNAFGIPGITVDTHFGRLVRRWRWTA-------------EEDPV--KVEHA 173
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALP 775
+ L DQ TL L +++I G+ C R P CG C L +C F A P
Sbjct: 174 V-GELIERDQWTL--LSHRVIFHGRRVCHARKPACGVCVLAKDCPSFGLGPTEPLLAAP 229
>sp|P53397|OGG1_YEAST N-glycosylase/DNA lyase OS=Saccharomyces cerevisiae (strain ATCC
204508 / S288c) GN=OGG1 PE=1 SV=1
Length = 376
Score = 36.6 bits (83), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 40/72 (55%), Gaps = 5/72 (6%)
Query: 620 KEFLNRLVELHGSIDLEWLRNVPPDK----VKEYLLDIEGLGLKSVECVRLLSLQHIAF- 674
++ +N E + + D +L+++ D V+E+L+ G+G K +CV L+ L
Sbjct: 198 RKLVNDKAEANITSDTTYLQSICKDAQYEDVREHLMSYNGVGPKVADCVCLMGLHMDGIV 257
Query: 675 PVDVNVGRIAVR 686
PVDV+V RIA R
Sbjct: 258 PVDVHVSRIAKR 269
>sp|Q08214|NTG2_YEAST DNA base excision repair N-glycosylase 2 OS=Saccharomyces
cerevisiae (strain ATCC 204508 / S288c) GN=NTG2 PE=1
SV=1
Length = 380
Score = 36.6 bits (83), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 29/119 (24%), Positives = 53/119 (44%), Gaps = 30/119 (25%)
Query: 680 VGRIAVRLGWVPLQPLPGDLHIH-LLKEYPVMDKI----------QMYLWPRLCYLDQKT 728
+G + ++ GW + + D+H+H L K + +D I ++ +W L
Sbjct: 249 MGYLTLQKGWGLIAGICVDVHVHRLCKMWNWVDPIKCKTAEHTRKELQVW-----LPHSL 303
Query: 729 LYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEF 787
YE++ ++ FG++ C R C C A+ V +AR +EK I +S+F
Sbjct: 304 WYEINTVLVGFGQLICMARGKRCDLC--------LANDVCNAR------NEKLIESSKF 348
>sp|O70249|OGG1_RAT N-glycosylase/DNA lyase OS=Rattus norvegicus GN=Ogg1 PE=2 SV=1
Length = 345
Score = 35.8 bits (81), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 2/68 (2%)
Query: 626 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSL-QHIAFPVDVNVGRIA 684
L E G L+ LR ++ + L + G+G K +C+ L++L + A PVD++V +IA
Sbjct: 216 LEEQGGPAWLQQLRVASYEEAHKALCTLPGVGTKVADCICLMALDKPQAVPVDIHVWQIA 275
Query: 685 VR-LGWVP 691
R GW P
Sbjct: 276 HRDYGWQP 283
>sp|O27397|OGG1_METTH Probable N-glycosylase/DNA lyase OS=Methanothermobacter
thermautotrophicus (strain ATCC 29096 / DSM 1053 / JCM
10044 / NBRC 100330 / Delta H) GN=MTH_1342 PE=3 SV=1
Length = 312
Score = 35.8 bits (81), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 1/56 (1%)
Query: 629 LHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI-AFPVDVNVGRI 683
L +D+ + + D +E LL++ G+G K +C+ L + AFPVDV + RI
Sbjct: 205 LAEEMDIRRIDGMDYDDARELLLELSGVGPKVADCILLYGFRKTEAFPVDVWIRRI 260
>sp|Q9FNY7|OGG1_ARATH N-glycosylase/DNA lyase OGG1 OS=Arabidopsis thaliana GN=OGG1 PE=1
SV=1
Length = 365
Score = 35.4 bits (80), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 28/53 (52%), Gaps = 4/53 (7%)
Query: 636 EWL---RNVPPDKVKEYLLDIEGLGLKSVECVRLLSL-QHIAFPVDVNVGRIA 684
EWL R V + L + G+G K C+ L SL QH A PVD +V +IA
Sbjct: 235 EWLLSLRKVELQEAVAALCTLPGVGPKVAACIALFSLDQHSAIPVDTHVWQIA 287
>sp|P63540|END3_MYCTU Endonuclease III OS=Mycobacterium tuberculosis GN=nth PE=3 SV=1
Length = 245
Score = 35.0 bits (79), Expect = 3.1, Method: Composition-based stats.
Identities = 42/179 (23%), Positives = 65/179 (36%), Gaps = 27/179 (15%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD +E+ I+ G A + LVE G VP K L+ + G+
Sbjct: 78 ADRTELESLIRPTGFYRNKAASLIGLGQALVERFGG-------EVPATMDK--LVTLPGV 128
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ + + VD + GR+ R W E PV K++
Sbjct: 129 GRKTANVILGNAFGIPGITVDTHFGRLVRRWRWTT-------------AEDPV--KVEQA 173
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALP 775
+ +++K L +++I G+ C R P CG C L +C F A P
Sbjct: 174 VGE---LIERKEWTLLSHRVIFHGRRVCHARRPACGVCVLAKDCPSFGLGPTEPLLAAP 229
>sp|P63541|END3_MYCBO Endonuclease III OS=Mycobacterium bovis (strain ATCC BAA-935 /
AF2122/97) GN=nth PE=3 SV=1
Length = 245
Score = 35.0 bits (79), Expect = 3.1, Method: Composition-based stats.
Identities = 42/179 (23%), Positives = 65/179 (36%), Gaps = 27/179 (15%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD +E+ I+ G A + LVE G VP K L+ + G+
Sbjct: 78 ADRTELESLIRPTGFYRNKAASLIGLGQALVERFGG-------EVPATMDK--LVTLPGV 128
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ + + VD + GR+ R W E PV K++
Sbjct: 129 GRKTANVILGNAFGIPGITVDTHFGRLVRRWRWTT-------------AEDPV--KVEQA 173
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALP 775
+ +++K L +++I G+ C R P CG C L +C F A P
Sbjct: 174 VGE---LIERKEWTLLSHRVIFHGRRVCHARRPACGVCVLAKDCPSFGLGPTEPLLAAP 229
>sp|P34648|YOT2_CAEEL Uncharacterized protein ZK632.2 OS=Caenorhabditis elegans
GN=ZK632.2 PE=4 SV=1
Length = 710
Score = 35.0 bits (79), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 42/199 (21%), Positives = 81/199 (40%), Gaps = 29/199 (14%)
Query: 4 LDQLRKQINGPSSQLQEQNRVIQRKSWNPKGLWFTAKSRGSQKIIQVERNGLPTQKMPPR 63
LD +Q+ + +++ W K + T +S+ +K++++ + P
Sbjct: 454 LDDYIRQLEKSGGAGDDAKTKMEKSKWRQKLMAATHESQKLEKLVKIAK---------PA 504
Query: 64 YSRGLQVDMEQQLMQSTTAD---IIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPY 120
+GL+ QL + D ++K + R Q P Q GP +A +P
Sbjct: 505 VVKGLE-----QLETTAANDRQAFLKKLMGVRARKEIDQTPSQ-----GPGPSTSATLPA 554
Query: 121 QVGPSSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDD-GTTKDQTSNEDMEK 179
V P+S V H+ K K++ E+ + + N + +D+ + KD+ S E +K
Sbjct: 555 TVAPTSTKAVEVEHEKKMTPLKVEKEIAASLDSSEIKNSLPAVDEPSSVKDEVSEETPQK 614
Query: 180 WWQKEREVFEGRIQSFTAR 198
E F ++Q A+
Sbjct: 615 ------EAFGSKVQKRVAQ 627
>sp|A1KSU5|PYRC_NEIMF Dihydroorotase OS=Neisseria meningitidis serogroup C / serotype 2a
(strain ATCC 700532 / FAM18) GN=pyrC PE=3 SV=1
Length = 344
Score = 34.7 bits (78), Expect = 4.7, Method: Composition-based stats.
Identities = 28/119 (23%), Positives = 51/119 (42%), Gaps = 7/119 (5%)
Query: 212 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 271
+GS + ++ +YLT + L A + F L T SD + L P+
Sbjct: 65 EGSAFEPLMTLYLTDQATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124
Query: 272 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 330
++ G VT+PE D + DR+ AF R+ + L + ++KV+ + + +
Sbjct: 125 MAKQDILFLVHGE---VTDPEID----IFDREAAFIERVMKPVLAQVPNLKVVFEHITT 176
>sp|Q89A45|MUTY_BUCBP A/G-specific adenine glycosylase OS=Buchnera aphidicola subsp.
Baizongia pistaciae (strain Bp) GN=mutY PE=3 SV=1
Length = 351
Score = 34.3 bits (77), Expect = 5.2, Method: Composition-based stats.
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 736 MITFGKIFCKKRSPNCGACPLRGECKHF 763
+I G + C K +P C CPL+ CK +
Sbjct: 182 LIDIGALICLKSNPKCNICPLKSTCKSY 209
>sp|A0R567|MUTY_MYCS2 A/G-specific adenine glycosylase OS=Mycobacterium smegmatis (strain
ATCC 700084 / mc(2)155) GN=mutY PE=3 SV=1
Length = 293
Score = 34.3 bits (77), Expect = 5.4, Method: Composition-based stats.
Identities = 36/136 (26%), Positives = 49/136 (36%), Gaps = 25/136 (18%)
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
RL E I E+ VP D + LL + G+G + V + Q VD NV R+
Sbjct: 86 RLHECAVVIASEYDDVVPRDV--DTLLTLPGIGAYTARAVACFAYQASVPVVDTNVRRVV 143
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPV----MDKIQMYLWPRLCYLDQKTLYELHYQMITFG 740
R +H + P +D + L P T ++ G
Sbjct: 144 TRA-------------VHGAADAPASTRDLDMVAALLPP------DTTAPTFSAALMELG 184
Query: 741 KIFCKKRSPNCGACPL 756
C RSP CG CPL
Sbjct: 185 ATVCTARSPRCGICPL 200
>sp|P44319|END3_HAEIN Endonuclease III OS=Haemophilus influenzae (strain ATCC 51907 / DSM
11121 / KW20 / Rd) GN=nth PE=3 SV=1
Length = 211
Score = 34.3 bits (77), Expect = 6.3, Method: Composition-based stats.
Identities = 37/157 (23%), Positives = 61/157 (38%), Gaps = 27/157 (17%)
Query: 606 IKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVR 665
IK G N A I + L+E H VP ++ E L + G+G K+ V
Sbjct: 76 IKTIGLFNSKAENIIKTCRDLIEKHNG-------EVPENR--EALEALAGVGRKTANVVL 126
Query: 666 LLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLD 725
+ H VD ++ R+ R + + + + LLK P K+ ++ W
Sbjct: 127 NTAFGHPTIAVDTHIFRVCNRTNFAAGKDVV-KVEEKLLKVVPNEFKVDVHHW------- 178
Query: 726 QKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+I G+ C R P CG+C + C++
Sbjct: 179 ----------LILHGRYTCIARKPRCGSCIIEDLCEY 205
>sp|P54137|NTH1_CAEEL Probable endonuclease III homolog OS=Caenorhabditis elegans
GN=nth-1 PE=1 SV=2
Length = 298
Score = 33.9 bits (76), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 27/112 (24%), Positives = 47/112 (41%), Gaps = 19/112 (16%)
Query: 650 LLDIEGLGLKSVECVRLLSL-QHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L + G+G K V ++ + + VD +V RI+ RLGW+ +
Sbjct: 149 LCALPGVGPKMANLVMQIAWGECVGIAVDTHVHRISNRLGWI---------------KTS 193
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+K Q L L + +++ ++ FG++ C+ P CG C R C
Sbjct: 194 TPEKTQKALE---ILLPKSEWQPINHLLVGFGQMQCQPVRPKCGTCLCRFTC 242
>sp|P31378|NTG1_YEAST Mitochondrial DNA base excision repair N-glycosylase 1
OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c)
GN=NTG1 PE=1 SV=1
Length = 399
Score = 33.9 bits (76), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 50/106 (47%), Gaps = 5/106 (4%)
Query: 3 RLDQLRKQINGPSSQLQEQNRVIQRKSWNPKGLW--FTAKSRGSQKIIQVERN-GLPTQK 59
RL +L K ++ + +Q R Q ++W PKGLW G +II RN G Q
Sbjct: 267 RLTKLWKWVDAQKCKTPDQTRT-QLQNWLPKGLWTEINGLLVGFGQIITKSRNLGDMLQF 325
Query: 60 MPPRYSR-GLQVDMEQQLMQSTTADIIRKFKDLTIRDGGSQLPLQC 104
+PP R L D++ QL + +I+ K + +G +L ++
Sbjct: 326 LPPDDPRSSLDWDLQSQLYKEIQQNIMSYPKWVKYLEGKRELNVEA 371
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.316 0.132 0.395
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 420,498,180
Number of Sequences: 539616
Number of extensions: 18434944
Number of successful extensions: 47585
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 124
Number of HSP's that attempted gapping in prelim test: 47289
Number of HSP's gapped (non-prelim): 340
length of query: 1105
length of database: 191,569,459
effective HSP length: 128
effective length of query: 977
effective length of database: 122,498,611
effective search space: 119681142947
effective search space used: 119681142947
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 67 (30.4 bits)